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TRIPHOSPHATE OLIGONUCLEOTIDE MODIFICATION REAGENTS 

AND USES THEREOF 

RELATED APPLICATIONS 

This application is related to co-owned U.S. utility application, 
entitled "FUNCTIONAL BIOPOLYMER MODIFICATION REAGENTS AND 
USES THEREOF", to Schwartz et aL, filed August 1, 2000 (Attorney 
5 Docket No. 37154-751), and to U.S. provisional application Serial No. 
60/191.186, entitled "HYDRAZINE-BASED AND CARBONYL-BASED 
BIFUNCTIONAL CROSSLINKING AGENTS", to Schwartz, filed March 22, 
2000. The disclosures of the above-referenced applications are 
incorporated herein by reference in their entirety. 
10 FIELD OF THE INVENTION 

Monomers and methods for preparation, detection and 
immobilization of macromolecules, including biopolymers, and for 
preparation and detection of macromolecular conjugates are provided. 
The monomers for use in the methods provided herein include hydrazino, 
1 5 oxyamino and carbonyl substituted nucleoside triphosphates. 

Biopolymers, including oligonucleotides, possessing hydrazino, oxyamino 
or carbonyl modifications are also provided. 
BACKGROUND OF THE INVENTION 

Polymerase chain reaction (PCR) expression of oligonucleotides is a 
20 powerful tool in molecular biology. The PCR product is generally labelled 
to aid in detection. Various methods of direct and indirect labelling of the 
PCR product have been developed. Direct labelling involves the 
incorporation of a labelled monomer during enzymatic synthesis, while 
indirect labelling refers to post-synthetic introduction of the label. 
25 For example, methods for direct labelling of PCR products are 

known (see, U.S. Patent Nos 5,242,756 and International Patent 
Application Publication No. WO 99/65993) In such methods, modified 
nucleoside triphosphates are incorporated into oligonucleotides during 
during amplification; and conjugation of a xanthene or cyanine label to a 
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nucleoside triphosphate is effected via an amide bond formed with the 
modified nucleoside triphosphates. Other methods involve conjugation 
of a biotin or metal chelating label to an oligonucleotide via an amide or 
hydrazide bond is used to forme a desired oligonucleotide-label conjugates 
(see, U.S. Patent Nos. 4,707,440 and 4,889,798). Another method for 
indirect labelling of oligonucleotides involves incorporation of a boronic 
acid containing nucleoside triphosphate into an oligonucleotide during 
enzymatic synthesis. Complexation of this boronic acid modified 
oligonucleotide with a hydroxamic acid derivatized label provides the 
desired labelled oligonucleotide (see, U.S. Patent No. 5,876,938). 

These methods of oligonucleotide labelling, however, are limited by 
the lack of stability of the label to the amplification reaction conditions, 
the inability to selectively and specifically incorporate multiple labels, and 
the instability of succinimidyl esters. Preparation of activated 
functionalities, such as succinimidyl esters or maleimides, of labels can be 
costly, and not even possible in some instances, particularly under PCR 
conditions. 

Thus, due to the limitations of currently available methods as 
described above, there is a need for efficient methods for labelling of 
oligonucleotides. Therefore, it is an object herein to provide monomers 
and methods for labelling of oligonucleotides without the need for post- 
synthetic modification of the oligonucleotide. It is also an object herein 
to provide monomers and methods for enzymatic synthesis of modified 
biopolymers, including oligonucleotides, that can be specifically labelled. 
A further object herein is to provide the resulting modified 
oligonucleotides. 
SUMMARY OF THE INVENTION 

Oligonucleotide monomers containing hydrazino, oxyamino, or - 
carbonyl groups that can be incorporated into an oligonucleotide chain 
during enzymatic oligonucleotide synthesis are provided. Methods for 
immobilization and conjugation of biopolymer first components, 
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particularly oligonucleotides, containing hydrazino, oxyamino, or carbonyl 
modifications are provided. The monomers are triphosphate nucleoside 
derivatives that can be incorporated into an oligonucleotide first 
component during enzymatic synthesis, including, but not limited to, 
synthesis by polymerase chain reactions (PCR), reverse trascriptases, 
including, but. not limited to, AMV reverse transcriptase, MMLV reverse 
transcriptase and superscript reverse transcriptase, and polymerases, 
including, but not limited to, Taq polymerase, DNA plymerase, Klenow 
fragment and T4 DNA polymerase. The resulting first components can 
then be used for any purpose for which oligonucleotides are used. They 
are particularly suitable for conjugation to a second component or 
immobilizion on a surface. The monomers provided herein advan- 
tageously are readily incorporated into oligonucleotide chains, hence can 
be used in any application that involves or uses a nucleoside 
triphosphate, such as DNA and RNA sequencing, detecting, labeling and 
amplification methodologies. Monophosphate and diphoshate forms of 
the monomers as well as nucleic acid chains containing the incorporated 
monomers are provided. 

The monomers provided herein are also useful as mass modifiers in 
DNA sequencing by mass spectrometry (see, e.g. . U.S. Patent Nos. 
6,074,823 and 5,547,835). The monomers can be incorporated into 
oligonucleotides for the accurate determination of base composition 
(Muddiman et aL (1997) Anal. Chem. 69:1543), and for the scoring of 
single nucleotide polymorphisms (SNPs) (Chen et aL (1999) Anal. Chem. 
71:3118). The monomers can also be used to study the mechanisms by 
which ribozymes effect catalytic cleavage (Earnshaw et aL (2000) 
Biochemistry 39:6410). The monomers can be incorporated into 
antisense oligonucleotides to increase their resistance to enzymatic 
degradation (Verheijen et aL (2000) Bioorq. Med . Chem. Lett. 10:801 ), 
their overall potency (Flanagan et aL (1999) Proc. Natl. A nad. Sci. USA 
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96:3513) and the stability of their hybrids with the complementary RNA 

sequences (Compagno et aL (1999) J. Biol. Chem. 274 :8191). 

The monomers possess a triphosphate-ribose-nucleobase motif for 

recognition by the enzymatic catalyst. 

Riboses for use in the monomers and methods herein are well - 

known to those of skill in the art, and include, but are not limited to, fully 

hydroxylated sugars such are ribose, deoxyriboses such as 2- 

deoxyriboses, and dideoxy riboses such as 2,3-dideoxyribose. 

Nucleobases for use in the monomers and methods herein are also 

well known to those of skill in the art, and include, but are not limited to, 

cytosines, uracils, adenines, guanines and thymines, and analogs thereof, 

including deaza analogs. 

The monomers also possess, in addition to the triphosphate group, 

a protected or unprotected hydrazine protected or unprotected oxyamino 
(-0-NH 2 ), or carbonyl moiety for formation of a hydrazone or oxime 

linkage with an appropriately modified surface or second component. 

The hydrazino moiety can be an aliphatic, aromatic or heteroaromatic 
hydrazine, semicarbazide, carbazide, hydrazide, thiosemicarbazide, 

thiocarbazide, carbonic acid dihydrazine or hydrazine carboxylate (see, 
FIGURE 1). The protecting groups are salts of the hydrazino or oxyamino 
group, including but not limited to, mineral acids salts, such as but not 
limited to hydrochlorides and sulfates, and salts of organic acids, such as 
but not limited to acetates, lactates, malates, tartrates, citrates, 
ascorbates, succinates, butyrates, valerates and fumarates, or any amino 
or hydrazino protecting group known to those of skill in the art (see, e.g. , 
Greene et aL (1 999) Protective Groups in Organic Synthesis (3rd Ed.) (J. 
Wiley Sons, Inc.)). The carbonyl moiety can be any carbonyl containing 
group capable of forming a hydrazone or oxime linkage with one or more 
of the above hydrazino or oxyamino moieties. Preferred carbonyl 
moieties include aldehydes and ketones. 
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Second components include, but are not limited to, macromolcules, 
biopolymers as defined herein, polymers including, but not limited to, 
polyamines, polyamides, polyethers and polyethylene glycols, and other 
compounds of interest herein for use in assays, kits, diagnostic arrays, 
5 and the like, including, but not limited to, intercalators, vitamins, reporter 
molecules, cholesterols, phospholipids, biotin, phenazine, phenanthridine, 
anthraquinone, acridine, fluoresceins, rhodamines, coumarins, dyes, 
antibodies, haptens, antigens, enzymes, and detection reagents including, 
but not limited to, fluorophores, metals including, but not limited to, gold, 

10 metal chelates, chromophores, fluorophore precursors and chromophore 
precursors, that possess or are modified to possess a hydrazino, 
oxyamino or carbonyl group that is complementary to the carbonyl, 
oxyarhino or hydrazino group of the oligonucleotide of formula (II) for 
formation of hydrazone or oxime linkage. Dendrimeric compounds that 

15 possess a plurality of detectable groups, including, but not limited to, 
reporter molecules, fluorophores> chromophores, fluorophore precursors 
and chromophore precursors, are also contemplated herein as second 
components. Fluorophore precursors and chromophore precursors are 
compounds that react with the hydrazino, oxyamino or carbonyl group of 

20 the modified oligonucleotide to form a fluorogenic or chromogenic group 

for analysis. Such groups are preferred due to the absence of 

background noise in the resulting assay. Preferred conjugates include 

those containing a hydrazone linkage. 

In one embodiment, the monomers for use in the methods provided 

25 herein are capture nucleoside triphosphates (cNTPs) that have formula (I): 

P'-S'-B'-M-X 

or a dervative thereof, as defined herein, where P 1 is a triphosphate 
group, as defined herein; S 1 is a ribose, a deoxyribose or a dideoxyribose; 
B 1 is a nucleobase; X is a protected or unprotected hydrazino group, a 
30 protected or unprotected oxyamino group, or a carbonyl derivative, where 
the protecting group is a salt or any amino or hydrazino protecting group 
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known to those of skill in the art; and M is a divalent group having any 
combination of the following groups, which can be combined in any 
order: arylene, heteroarylene, cycloalkylene, C(R') 2 , -C(R') - C(R')-, 
>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C = C-, 0, S(A)„ P(D) b (R'), PfD^ER 1 ), N(R'), 
5 >N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is O or 
NR 1 ; D is S or O; and E is S, O or NR'; each R 1 is a monovalent group 
independently selected from hydrogen and M 1 -R 4 ; each M 1 is a divalent 
group independently having any combination of the following groups, 
which groups can be combined in any order: a direct link, arylene, 

10 heteroarylene, cycloalkylene, C(R 5 ) 2 , -C(R 5 ) = C(R 5 )-, >C = C(R 2 )(R 3 ), 
>C(R 2 )(R 3 ), -C = C- ( O, S(A)„ P(D) b (R 5 ), P(D) b (ER 5 ), N(R 5 ), N(COR 5 ), 
>N + (R 2 ){R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is 0 or 
NR 5 ; D is S or O; and E is S, O or NR 5 ; R 4 and R 5 are each independently 
selected from among hydrogen, halo, pseudohalo, cyano, azido, nitro, 

15 SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R 10 ; R 9 and R 10 are each independently selected from hydrogen, 

20 alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 
and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 

25 independently a monovalent group selected from hydrogen, alkyl, alkenyl, 
alkynyl, haloalkyl, haloalkoxy, aryl; aralkyl, aralkenyl, aralkynyl, 
heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; and 

30 R\ R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 and R 10 are optionally substituted 

with one or more substituents each independently selected from Z, 
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wherein Z is selected from alkyl, alkenyl, alkynyl, aryl. cycloalkyl, 
cycloalkenyl, hydroxy, S(0) h R 20 , NR 20 R 21 , COOR 20 , COR 20 , CONR 20 R 2 \ 
OC(O)NR 20 R 21 , N(R 20 )C(0)R 2 ', alkoxy, aryloxy, heteroaryl, heterocyclyl, 
heteroaryloxy, heterocyclyloxy, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, 
5 heteroaralkenyl, heteroaralkynyl, aralkoxy, heteroaralkoxy, 

alkoxycarbonyl, carbamoyl, thiocarbamoyl, alkoxycarbonyl, carboxyaryl, 
halo, pseudohalo, haloalkyl and carboxamido; h is 0, 1 or 2; and R 20 and 
R 21 are each independently selected from among hydrogen, halo, 
pseudohalo, cyano, azido, nitro, trialkylsilyl, dialkylarylsilyl, alkyldiarylsilyl, 
10 triarylsilyl, alkyl, alkenyl. alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy, 
amino, amido, alkylamino, dialkylamino, alkylarylamino, diarylamino and 

15 arylamino. 

Macromolecutions, particularly biopolymers, including 

oligonucleotides, that are modified by incorporation of the above 
monomers are provided. Thus, in certain embodiments, provided herein 
are oligonucleotide analogs of formula (II): 
20 P 2 O 2 

0 1/ S 1 or I 

25 M , 

or a derivative thereof, where O 1 and O 2 are each independently 
oligonucleotides or analogs thereof, such as protein nucleic acids (PNAs>; 
30 P 2 is a phosphodiester group, resulting from the coupling of a compound 
of formula (I) with O 1 or analog thereof; and S\ B\ M and X are selected 
as above. 

Such oligonucleotide analogs are useful as antisense drugs, cis- 
elements acting as regulators of gene expression, and substrates for RNA 
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binding proteins such as HIV-1 Rev and Tat (see, e.g. , Agrawal (1999) 
Biochim. Biophys. Acta 1489 :53; Grooke (1 999) Biochim. Biophys. Acta 
1489 :31; Gryaznov (1999) Biochim. Biophys. Acta 1489 :131; and 
Morishita (2000) Nipp. Yakuri. Zass. 115 :123). 
5 Oligonucleotides covalently immobilized on a solid surface, as 

described herein, are also provided. The oligonucleotides are of formula 
(II), and are immobilized through a covalent hydrazone or oxime linkage. 
The solid surface is modified to possess a hydrazino, oxyamino, or 
carbonyl group that is complementary to the hydrazino, oxyamino or 

10 carbonyl moiety of the oligonucleotide of formula (II) for formation of a 
hydrazone or oxime linkage. In preferred embodiments, the 
oligonucleotides are immobilized through a covalent hydrazone linkage. 

The immobilized oligonucleotides, as well as non-immobilized 
forms, are useful for a variety of purposes known to those of skill in the 

15 art, including, but not limited to, preparation of microarrays, diagnostic 
probe assays, DNA or RNA amplification, for example by solid phase 
polymerase chain reactions (PCR), molecular computing (see, e.g., 
Adleman (1994) Science 266 :1021-1024: Kari (1997) Mathematical 
Intelligencer 19 :9-22; Frutos et al. (1997) Nucleic Acids Res. 25 :4748: 

20 Smith et al. (1998) J. Comp. Biol. 5:255: Liu et al. (1998) J. Comp. Biol. 
5:267; Frutos et al. (1 998) J. Am. Chem. Soc. 120 :10277; WangetaL 
(1999) Biosvstems 52 :189-191: Liu et aL (1999) Biosystems 52 :25-33; 
Liu et aL (2000) Nature 403 :175-179: European Patent Application 
Publication No. EP 0 772 135; Reed et aL (June 2000) Scientific 

25 American :86-93), molecular addressing (Niemeyer et aL (1 994) Nucl. 
Acids Res. 22(25) :5530-5539). DNA and RNA sequencing methods, 
including mass spectrometry methods (see, e.g. , U.S. Patent Nos. 
6,074,823 and 5,547,835), nucleic acid diagnostics, including SNP and 
other polymorphism analyses and detection methods, and in studying the 

30 molecular electronics of DNA (see, e.g. / U.S. Patent Nos. 6,071,699, 
6,066,448, 5,952,172 and 5,824,473). 
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The oligonucleotides can also be used in PCR-based sequencing 
methods, particularly solid phase sequencing methods using the 
immobilized oligonucleotides {Mustajoki et aL (1997) Genome Res. 
7:1054). They can also be used to measure the interaction forces 
5 between single strands of DNA (Lee et aL (1 994) Science 266:771 ), in 
solid phase-mediated transfection of oligonucleotides (Bielinska et aL 
(2000) J. Biomaterials 2J,:877), and in solid phase cloning to create 
libraries suitable for direct solid phase sequencing (Hultman et aL (1 994) 
J. Biotechnol. 35:229). The immobilized oligonucleotides can also be 
1 0 used in DNA chip technology to create arrays of oligonucleotides which 
are used to compare the qualitative and quantitative characteristics of 
gene expression profiles, mutations, insertions and deletions in normal 
and diseased states (De Benedetti et aL (2000) Int. J. Biol. Mari^r* 
15:1). They can also be used to identify and characterize the DNA 
1 5 binding site of DNA binding proteins (Roth et aL (1 995) EMBO J. 

14:2106; Carlsson et aL (1995) Anal. Biochem. 232:172) Immobilized 
oligonucleotides bound to Sephacryl S-500 particles via a CNBr-activation 
procedure can be used to assemble extended DNA duplexes by 
phosphorylation, ligation and restriction enzyme digestion of assemblies 
20 of annealed oligonucleotides in solid phase (Hostomsky et aL (1 987) 

Nucleic Acids Svmp Ser 18:241 ) . The immobilized oligonucleotides can 
also be used in PCR, RT-PCR (Kozwich et aL (2000) ApdI. Environ. 
MicrobioL 66:2711; Blomqvist et aL (1999) J. Clin. Microhiol 37-9tfic V 
transcription (Marble et aL (1 995) Biotechnol. Pmn H:393; Fujita et aL 
25 (1 993) Biotechniques 14:608), ligation reactions (Filippov et aL (1 990) 
Bioorq. Khim. 16: 1 045), and in studying DNA repair mechanisms (Salles 
et aL (1999) Biochimie 81:53) 

In embodiments where X of the modified oligonucleotide is a 
hydrazino or oxyamino group, the solid surface can be modified to 
30 possess an epoxide, o-bromocarbonyl, maleimide, maleic anhydride, 

isothiocyanate or isocyanate group. Such solid surfaces can be prepared 
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by methods provided herein or other methods well known to those of skill 
in the art. For example, reaction of pentafluorophenyl 4-isothiocyanato- 
benzoate with an amino solid surface results in formation of an 
isothiocyanato modified solid surface. Some of these surfaces are 
5 commercially available from, e.g. . Pierce (Rockford, Illinois), SINTEF 
Applied Chemistry (Trondheim, Norway), Rapp Polymere Gmbh 
(Tubingen, Germany), and Dyno Particles AS (Trondheim, Norway). 
Reaction of the hydrazino or oxyamino group of the modified 
oligonucleotide with the epoxide, a-bromocarbonyl, maleimide, maleic 

10 anhydride, isothiocyanate or isocyanate group of the solid surface results 
in covalent attachment of the oligonucleotide to the solid surface. 

In certain embodiments, particularly where X is an oxyamino group, 
the immobilfzed oligonucleotides are selected such that the solid surface 
is not modified with an aldehyde or epoxide group. 

1 5 Oligonucleotide conjugates are provided. The conjugates are 

prepared from the modified oligonucleotide first components of formula 
(II), The modified oligonucleotide of formula (II) is reacted with a 
complementary derivative of a second component to form a hydrazone or 
oxime covalent linkage. The second components include, but are not 

20 limited to, biopolymers as defined herein, polymers including, but not 
limited to, polyamines, polyamides, polyethers and polyethylene glycols, 
and other compounds of interest herein for use in assays, kits, diagnostic 
arrays, and the like, including, but not limited to, intercalators, vitamins, 
reporter molecules, cholesterdls, phospholipids, biotin, phenazine, 

25 phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, 

coumarins, dyes, antibodies, haptens, antigens, enzymes, and detection 
reagents including, but not limited to, fluorophores, metals including, but 
not limited to, gold, metal chelates, chromophores, fluorophore 
precursors and chromophore precursors, that possess or are modified to 

30 possess a hydrazino, oxyamino or carbonyl group that is complementary 
to the carbonyl, oxyamino or hydrazino group of the oligonucleotide of 
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formula (II) for formation of hydrazone or oxime linkage. Dendrimeric 
compounds that possess a plurality of detectable groups, including, but 
not limited to, reporter molecules, fluorophores, chromophores, 
fluorophore precursors and chromophore precursors, are also 
5 contemplated herein as second components. Fluorophore precursors and 
chromophore precursors are compounds that react with the hydrazino, 
oxyamino or carbonyl group of the modified oligonucleotide to form a 
fluorogenic or chromogenic group for analysis. Such groups are preferred 
due to the absence of background noise in the resulting assay. Preferred 

10 conjugates include those containing a hydrazone linkage. 

In embodiments where X is a hydrazino or oxyamino group, the 
second component can be modified to possess an epoxide, a-bromo- 
carbonyl, maleimide, maleic anhydride, isothiocyanate or isocyanate 
group. Such second components can be prepared by methods provided 

1 5 herein or by other methods well known to those of skill in the art. For 
example, reaction of pentafluorophenyl 4-isothiocyanatobenzoate with an 
amino or hydroxy group of a second component, such as a protein or 
oligosaccharide, results in formation of an isbthiocyanato modified second 
component. Reaction of the hydrazino or oxyamino group of the modified 

20 oligonucleotide with the epoxide, obromocarbonyl, maleimide, marteic 
anhydride, isothiocyanate or isocyanate group of the second component 
results in covalent attachment of the oligonucleotide to the second 
component to form the conjugates provided herein. 

In certain embodiments herein, the oligonucleotide conjugates are 

25 selected with the proviso that the covalent linkage is not an acyl 

hydrazone (-C(0)NHNH = ). In preferred embodiments, the oligonucleotide 
conjugates are prepared from modified oligonucleotides of formula (II) 
where P 2 is not a phosphoramidate. In other embodiments, the 
oligonucleotide conjugates are selected with the proviso that the second 

30 component is not a protein, particularly a glycoprotein, more particularly 
an immunoglobulin. 
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The modified oligonucleotides provided herein can also be 
conjugated to ligands such as growth factors, membrane-active bacterial 
proteins and asialoglycoproteins as delivery strategies for antisense 
therapy and gene therapy (see, e.g. , Perales et aL (1994) Eur. J. 
5 Biochem. 226 :255; Cristiano et al. (1996) Cancer Gene Ther. 3:4; 
Hoganson et aL (1 998) Hum. Gene Ther. 9:2565; Gottschalk et aL 
(1 995) Gene Ther. 2:498; Lu et aL (1 994) J. Nucl. Med. 35:269). 
Oligonucleotide-antigen conjugates can be used as immunomodulators in 
regulating airway eosinophilia in bronchial asthma (Shirota et aL (2000) J,, 

10 Immunol, 164 :5575). Oligonucleotide probes conjugated to fluorescent 
molecules are used in fluorescence in situ hybridization (FISH) for 
chromosome classification and the detection of chromosome aberrations 
(Pinkel et ajf (1 986) Proc. Natl. Acad. ScL USA 83:2934). 
Oligonucleotide conjugates are also used to study the mechanics of DNA 

15 hybridization, and to investigate proteinrDNA contacts of DNA binding 
proteins (Lannutti et aL (1 996) Biochemistry 35:9821 ; Brown et 
al.(1997) Curr. Qpin. Biotechnol. 8:45). 

Importantly, the immobilized oligonucleotides or oligonucleotide 
conjugates provided herein can be formed under aqueous conditions 

20 without the need for additional reagents, such as a reducing agent. 

Methods for the attachment of hydrazine oxyamino, or carbonyl 
modified oligonucleotides of formula (II) to appropriately modified 
surfaces, as described herein, are provided. The attachment is via a 
covalent hydrazone or oxime bond formed from the hydrazino, oxyamino, 

25 or carbonyl group of the modified oligonucleotide and a complementary 
carbonyl, oxyamino or hydrazino modified surface provided herein (see, 
FIGURE 2). 

Methods for the conjugation of oligonucleotide first components to 
second components, including, but not limited to, biopolymers as defined 
30 herein, polymers including, but not limited to, polyamines, polyamides, 
polyethers and polyethylene glycols, and other compounds of interest 
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herein for use in assays, kits, diagnostic arrays, and the like, including, 
but not limited to, intercalators, vitamins, reporter molecules, 
cholesterols, phospholipids, biotin, phenazine, phenanthridine, 
anthraquinone, acridine, fluoresceins, rhodamines, coumarins, dyes, 
5 antibodies, haptens, antigens, enzymes, and detection reagents including, 
but not limited to, fluorophores, metals including, but not limited to, gold, 
metal chelates, chromophores, fluorophore precursors and chromophore 
precursors, that possess or are modified to possess a hydrazino, 
oxyamino or carbonyl group that is complementary to the carbonyl, 

10 oxyamino or hydrazino group of the oligonucleotide of formula (II) for 
formation of hydrazone or oxime linkage, are provided. Dendrimeric 
compounds that possess a plurality of detectable groups, including, but 
not limited to, reporter molecules, fluorophores, chromophores, 
fluorophore precursors and chromophore precursors, are also 

1 5 contemplated herein as second components. Fluorophore precursors and 
chromophore precursors are compounds that react with the hydrazino, 
oxyamino or carbonyl group of the modified oligonucleotide to form a 
fluorogenic or chromogenic group for analysis. * 

In particular, methods for conjugation of a hydrazino, oxyamino, or 

20 carbonyl modified oligonucleotide of formula (II) with an appropriately 
modified second component are provided. The conjugation is achieved 
through a covalent hydrazone or oxime bond formed from the hydrazino, 
oxyamino or carbonyl group of the modified oligonucleotide and a second 
component possessing a complementary carbonyl, oxyamino or hydrazino 

25 group. 

In all embodiments herein, it is preferred that one, more preferably 
both, of the reactive partners ( e.g. . the hydrazino, oxyamino or carbonyl 
groups) are aromatic or heteroaromatic. Thus, in preferred embodiments, 
the compounds of formula (I) will be aryl or heteroaryl hydrazino or 
30 oxyamino derivatives, or aryl or heteroaryl carbonyl derivatives. In more 
preferred embodiments, the coupling partner ( e.g. . the modified solid 
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surface or the second component) will also possess an aryl or heteroaryl 
hydrazino or oxyamino group, or an aryl or heteroaryl carbonyl group. 
Hydrazone and oxime linkages formed from these preferred groups are 
more stable than the corresponding aliphatic hydrazone and oxime 
linkages, and thus are more preferred in certain applications. 

A composition, comprising one or more nucleoside triphosphates 
and a monomer provided herein. Methods using the composition to 
synthesize nucleic acid molecules that include the monomers provided 
herein incoporated into the chains. Nucleic acid molecules (i.e. 
oligonucleotides) in which the monomers provided herein are at the end 
are also provided; these are produced by providing the monomers as 
chain terminators, such as dideoxynucleotides. 
DESCRIPTION OF THE FIGURES 

FIGURE 1 illustrates incorporation of the capture nucleoside 
triphosphate (CNTP) monomers provided herein into an RNA/DNA target 
to form a modified RNA/DNA and capture of the modified RNA/DNA on a , 
solid surface. 

FIGURE 2 illustrates hydrazino and oxyamino derivatives provided 
herein. 

FIGURE 3 illustrates carbonyl groups exemplefied herein. 

FIGURE 4 shows the immobilization of a hydrazino or oxyamino 
modified oligonucleotide on a carbonyl modified solid support. 

FIGURE 5 shows the synthesis of a carbonyl (aldehyde) 
triphosphate monomer provided herein. 

FIGURE 6 shows the synthesis of a hydrazino modified fluorescein 
provided herein. 

FIGURE 7 illustrates the synthesis of a hydrazino triphosphate 
monomer provided herein. 

FIGURE 8 illustrates the synthesis of an oxyamino triphosphate 
monomer provided herein. 
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FIGURE 9 illustrates the modified fluorescent groups F* provided 

herein. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Definitions 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of skill 
in the art to which this invention belongs. All patents, patent 
applications and publications referred to throughout the disclosure herein 
are incorporated by reference in their entirety. In the event that there are 
a plurality of definitions for term herein, those in this section prevail. 

As used herein, "hydrazino groups" include, but are not limited to, 
hydrazines, hydrazides, semicarbazides, carbazides, thiosemicarbazides, 
thiocarbazides, hydrazine carboxylates and carbonic acid hydrazines (see, 
e.g. , FIGURE 2). 

As used herein, hydrazone linkages include, but are not limited to, 
hydrazones, acyl hydrazones, semicarbazones, carbazones, 
thiosemicarbazones, thiocarbazones, hydrazone carboxylates and 
carbonic acid hydrazones. 

As used herein, an oxyamino group has the formula -0-NH 2 . An 
oxime has the formula -0-N = R. 

As used herein, carbonyl-based derivatives include, but are not 
limited to, ketones and aldehydes. 

As used herein, a "triphosphate group" refers to 
(HO)P(0)(0)-0-P(0)(0)-0-P(0)(0)-. 

As used herein, a protected hydrazino or a protected oxyamino 
group refers to a hydrazino or oxyamino group that has been derivatized 
as a salt of the hydrazino or oxyamino group, including but not limited to, 
mineral acids salts, such as but not limited to hydrochlorides and sulfates, 
and salts of organic acids, such as but not limited to acetates, lactates, 
malates, tartrates, citrates, ascorbates, succinates, butyrates, valerates 
and fumarates, or any amino or hydrazino protecting group known to 



c 



( 



37154-752 



those of skill in the art (see, e.g. , Greene et aL (1 999) Protective Groups 
in Organic Synthesis (3rd Ed.) (J. Wiley Sons, Inc.)). Preferred amino 
and hydrazino protecting groups herein include, but are not Hmited to, 
amino or hydrazino protecting group useful in the synthesis of 
5 oligonucleotides, more preferably monomethoxytrityl (MMT), 

dimethoxytrityl (DMT), 9-fluorenylmethoxycarbonyI (FMOC), acetyl, 
trifluoroacetyl (TFA), benzoyl, or a hydrazone that is cleaved under mild 
acidic conditions ( e.g. , 100 mM acetate, pH 4.5-5.5) including, but not 
limited to, a hydrazone formed from a lower aliphatic aldehyde or ketone, 

10 preferably from acetone, cyclohexanone, propanal or 2-butanone. 

As used herein, an oligonucleotide is a nucleic acid, including, but 
not limited to, a ribonucleic acid (RNA), a deoxyribonucleic acid (DNA), 
and analog^thereof such as a protein nucleic acid (PNA), of any length, 
including chromosomes and genomic material, such as PCR products or 

15 sequencing reaction products, preferably DNA including double and single 
stranded forms. Singly stranded forms of the oligonucleotides are also 
provided. 

As used herein, a conjugate is a compound containing two 
components covalently linked together For example, a first component, 
20 e.g. , an oligonucleotide, is conjugated through a covalent hydrazone 
linkage to a second component, as defined herein, to form a conjugate. 

As used herein, a biopolymer is any compound found in nature, or 
derivatives thereof, made up of monomeric units. Biopplymers include, 
but are not limited to, oligonucleotides, peptides, peptide nucleic acids 
25 (PNAs), glycoproteins and oligosaccharides. Thus, the monomeric units 
include, but are not limited to, nucleotides, nucleosides, amino acids, 
PNA monomers, monosaccharides, and derivatives thereof. 

As used herein, a macromolecule refers to a molecule of colloidal 
size (i.e., of high molecular weight), including, but not limited to, 
30 proteins, polynucleic acids, polysaccharides and carbohydrates. 
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As used herein, complementary reactive groups are those that, 
when reacted together, form a covalent linkage, including, but not limited 
to, a hydrazone or oxime linkage. Thus, a hydrazino group, as defined 
herein, is complementary to a carbonyl group. An oxyamino group is also 
5 complementary to a carbonyl group. 

As used herein, a reporter molecule refers to a molecule, such as 
an enzyme or indicator, which is capable of generating a detectable signal 
( e.g. , by colorimetric, chemiluminescent, bioluminescent, fluorescent, or 
potentiometric means) when contacted with a suitable substrate under 
10 appropriate reaction conditions. Exemplary reporter enzymes include, but 
are not limited to, alkaline phosphatase, horseradish peroxidase, /?- 
galactosidase, aryl esterase, sulfatase and urease. 

As used herein, a nucleobase is a heterocyclic moiety that is found 
in naturally occurring oligonucleotides, including robonucleic acids (RNA) 
15 and deoxyribonucleic acids (DNA), and analogs thereof, including deaza 
analogs. Preferred nucleobases include, but are not limited to, cytosines, 
uracils, adenines, guanines and thymines, and analogs thereof including 
deaza analogs. 

As used herein, a derivative of a compound includes a salt, ester, 
20 enol ether, enol-ester, solvate or hydrate thereof that can be readily 
prepared by those of skill in this art using known methods for such 
derealization. Salts include, but are not limited to, amine salts, such as 
but not limited to N,N'<libenzylethylenediamine; chloroprocaine, choline, 
ammonia, diethanolamine and other hydroxyalkylamines, ethylenediamine, 
25 N-methylglucamine, procaine, N-benzylphenethylamine, 1-para-chloro- 
benzyl-2-pyrrolidin-1 '-ylmethylbenzimidazole, diethylamine and other 
alkylamines, piperazine and tris(hydroxymethyl)aminomethane; alkali 
metal salts, such as but not limited to lithium, potassium and sodium; 
alkali earth metal salts, such as but not limited to barium, calcium and 
30 magnesium; transition metal salts, such as but not limited to zinc; and 
other metal salts, such as but not limited to sodium hydrogen phosphate 
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and disodium phosphate; and also including, but not limited to, salts of 
mineral acids, such as but not limited to hydrochlorides and sulfates; and 
salts of organic acids, such as but not limited to acetates, lactates, 
malates, tartrates, citrates, ascorbates, succinates, butyrates, valerates 
5 and fumarates. Esters include, but are not limited to, alkyl, alkenyl, 

alkynyl, aryl, heteroaryl, aralkyl, heteroaralkyl, cycloalkyl and heterocyclyl 
esters of acidic groups, including, but not limited to, carboxylic acids, 
phosphoric acids, phosphinic acids, sulfonic acids, sulfinic acids and 
boronic acids. Enol ethers include, but are not limited to, derivatives of 

10 formula C = C(OR) where R is hydrogen, alkyl, alkenyl, alkynyl, aryl, 

heteroaryl, aralkyl, heteroaralkyl, cycloalkyl ar heterocyclyl. Enol esters 
include, but are not limited to, derivatives of formula C = C(OC(0)R) 
where R is Kydrogen, alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
heteroaralkyl, cycloalkyl ar heterocyclyl. Solvates and hydrates are 

15 complexes of a compound with one or more solvent or water molecule, 
preferably 1 to about 100, more preferably 1 to about 10, most 
preferably one to about 2, 3 or 4, solvent or water molecules. 

It is to be understood that the compounds provided herein can 
contain chiral centers. Such chiral centers can be of either the (R) or (S) 

20 configuration, or can be a mixture thereof. Thus, the compounds 
provided herein can be enantiomerically pure, or be stereoisomeric or 
diastereomeric mixtures. In the case of amino acid residues, such 
residues can be of either the L- or D-form. The preferred configuration for 
naturally occurring sugar residues is D. 

25 As used herein, alkyl, alkenyl and alkynyl carbon chains, if not 

specified, contain from 1 to 20 carbons, preferably 1 to 16 carbons, and 
are straight or branched. Alkenyl carbon chains of from 2 to 20 carbons 
preferably contain 1 to 8 double bonds, and the alkenyl carbon chains of 
T to 16 carbons preferably contain 1 to 5 double bonds. Alkynyl carbon 

30 chains of from 2 to 20 carbons preferably contain 1 to 8 triple bonds, and 
the alkynyl carbon chains of 2 to 16 carbons preferably contain 1 to 5 
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triple bonds. Exemplary alkyl, alkenyl and alkynyl groups herein include, 
but are not limited to, methyl, ethyl, propyl, isopropyl, isobutyl, n-butyl, 
sec-butyl, tert-butyl, isopentyl, neopentyl, tert-penytyl and isohexyl. The 
alkyl, alkenyl and alkynyl groups, unless otherwise specified, can be 
5 optionally substituted, with one or more groups, preferably alkyl group 
substituents that can be the same or different. As used herein, lower 
alkyl, lower alkenyl, and lower alkynyl refer to carbon chains having less 
than about 6 carbons. As used herein, "alk(en)(yn)yr refers to an alkyl 
group containing at least one double bond and at least one triple bond. 

10 As used herein, an "alkyl group substituent" includes halo, 

haloalkyl, preferably halo lower alkyl, aryl, hydroxy, alkoxy, aryloxy, 
alkyloxy, alkylthio, arylthio, aralkyloxy, aralkylthio, carboxy 
alkoxycarbonyl, oxo and cycloalkyl. 

As used herein, "aryl" refers to cyclic groups containing from 5 to 

15 19 carbon atoms. Aryl groups include, but are not limited to groups, 
such as fluorenyl, substituted fluorenyl, phenyl, substituted phenyl, 
naphthyl and substituted naphthyl, in which the substituent is lower alkyl, 
halogen, or lower alkoxy. 

As used herein, an "aryl group substituent" includes alkyl, cyclo- 

20 alkyl, cycloalkylalkyl, aryl, heteroaryl optionally substituted with 1 or 
more, preferably 1 to 3, substituents selected from halo, halo alkyl and 
alkyl, aralkyl, heteroaralkyl, alkenyl containing 1 to 2 double bonds, 
alkynyl containing 1 to 2 triple bonds, alk(en)(yn)yl groups, halo, 
pseudohalo, cyano, hydroxy, haloalkyl and polyhaloalkyl, preferably halo 

25 lower alkyl, especially trifluoromethyl, formyl, alkylcarbonyl, arylcarbonyl 
that is optionally substituted with 1 or more, preferably 1 to 3, 
substituents selected from halo, halo alkyl and alkyl, heteroarylcarbonyl, 
carboxy, alkoxycarbonyl, aryloxycarbonyl, aminocarbonyl, alkylaminocar- 
bonyl, dialkylaminocarbonyl, arylaminocarbonyl, diarylaminocarbonyl, 

30 aralkylaminocarbonyl, alkoxy, aryloxy, perfluoroalkoxy, alkenyloxy, 
alkynyloxy, arylalkoxy, aminoalkyl, alkylaminoalkyl, dialkylaminoalkyl, 
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arylaminoalkyl, amino, alkylamino, dialkylamino, arylamino, 
alkylarylamino, alkylcarbonylamino, arylcarbonylamino, azido, nitro, 
mercapto, alkylthio, arylthio, perfluoroalkylthio, thiocyano, isothibcyanb, 
alkylsulfinyl, alkylsulfonyl, arylsulfinyl, arylsulfonyl, aminosulfonyl, 
5 alkylaminosulfonyl, dialkylaminosulfonyl and arylaminosulfonyl. 

As used herein, "aralkyl" refers to an alkyl group in which one of 
the hydrogen atoms of the alkyl is replaced by an aryl group. 

As used herein, "heteroaralkyl" refers to an alkyl group in which 
one of the hydrogen atoms of the alkyl is replaced by a heteroaryl group. 
10 As used herein, "cycloalkyl" refers to a saturated mono- or multi- 

cyclic ring system, preferably of 3 to 10 carbon atoms, more preferably 3 
to 6 carbon atoms; cycloalkenyl and cycloalkynyl refer to mono- or 
multicyclic ring systems that respectively include at least one double 
bond and at least one triple bond. Cycloalkenyl and cycloalkynyl groups 
15 can preferably contain 3 to 10 carbon atoms, with cycloalkenyl groups 
more preferably containing 4 to 7 carbon atoms and cycloalkynyl groups 
more preferably containing 8 to 10 carbon atoms. The ring systems of 
the cycloalkyl, cycloalkenyl and cycloalkynyl groups can be composed of 
one ring or two or more rings which can be joined together in a fused, 
20 bridged or spiro-connected fashion, and can be optionally substituted 

with one or more alkyl group substituents. "Cycloalk(en)(yn)yl" refers to 
a cylcoalkyl group containing at least one double bond and at least one 
triple bond. 

As used herein, "heteroaryl" refers to a monocyclic or multicyclic 
25 ring system, preferably of about 5 to about 15 members where one or 
more- more preferably 1 to 3 of the atoms in the ring system is a 
heteroatom, that is, an element other than carbon, for example, nitrogen, 
oxygen and sulfur atoms. The heteroaryl can be optionally substituted 
with one or more, preferably 1 to 3, aryl group substituents. The 
30 heteroaryl group can be optionally fused to a benzene ring. Exemplary 
heteroaryl groups include, for example, furyl, imidazinyl, pyrrolidinyl, 
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pyrimidinyl, tetrazolyl, thienyl, pyridyl, pyrrolyl, N-methylpyrrolyl, 
quinolinyl and isoquinolinyl, with pyridyl and quinolinyl being preferred. 

As used herein, "heterocyclic" refers to a monocyclic or multicyclic 
ring system, preferably of 3 to 10 members, more preferably 4 to 7 
5 members, even more preferably 5 to 6 members, where one or more, 
preferably 1 to 3 of the atoms in the ring system is a heteroatom, that is, 
an element other than carbon, for example, nitrogen, oxygen and sulfur 
atoms. The heterocycle can be optionally substituted with one or more, 
preferably 1 to 3 aryl group substituents. Preferred substituents of the 

10 heterocyclic group include hydroxy, amino, alkoxy containing 1 to 4 
carbon atoms, halo lower alky!, including trihalomethyl, such as 
trifluoromethyl, and halogen. As used herein, the term heterocycle can 
include reference to heteroaryl. 

As used herein, the nomenclature alkyl, alkoxy, carbonyl, etc. are 

15 used as is generally understood by those of skill in this art. For example, 
as used herein alkyl refers to saturated carbon chains that contain one or 
more carbons; the chains can be straight or branched or include cyclic 
portions or be cyclic. As used herein, alicyclic refers to aryl groups that 
are cyclic. 

20 Where the number of any given substituent is not specified ( e.g. , 

"haloalkyl"), there can be one or more substituents present. For example, 
"haloalkyl" can include one or more of the same or different halogens. 
As another example, "C^alkoxyphenyl" can include one or more of the 
same or different alkoxy groups containing one, two or three carbons. 

25 As used herein, "halogen" or "halide" refers to F, CI, Br or I. 

As used herein, pseudohalides are compounds that behave 
substantially similar to halides. Such compounds can be used in the 
same manner and treated in the same manner as halides (X\ in which X is 
a halogen, such as CI or Br). Pseudohalides include, but are not limited 

30 to, cyanide, cyanate, thiocyanate, selenocyanate, trifluoromethoxy, 
trifluoromethyl and azide. 
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As used herein, "haloalkyl" refers to a lower alkyl radical in which 
one or more of the hydrogen atoms are replaced by halogen including, but 
not limited to, chloromethyl, trifluoromethyl, 1 -chloro-2-fluoroethyl and 
the like. 

5 As used herein, "haloalkoxy" refers to RO- in which R is a haloalkyl 

group. 

As used herein, "sulfinyl" or "thionyl" refers to -S(O)-. As used 
herein, "sulfonyl" or "sulfuryl" refers to -S(0) 2 -. As used herein, "sulfo" 
refers to -S(0) 3 -. 

10 As used herein, "carboxy" refers to a divalent radical, -C(0)0-. 

As used herein, "aminocarbonyl" refers to -C(0)NH 2 . 

As used herein, "alkylaminocarbonyl" refers to -C(0)NHR in which 
R is hydrogen or alkyl, preferably lower alkyl. As used herein "dialkyl- 
aminocarbonyl" as used herein refers to -C(0)NR'R in which R' and R are 
1 5 independently selected from hydrogen or alkyl, preferably lower alkyl; 
"carboxamide" refers to groups of formula -NR'COR. 

As used herein, "diarylaminocarbonyl" refers to -C(0)NRR' in which 
R and R' are independently selected from aryl, preferably lower aryl, more 
preferably phenyl. 

20 As used herein, "aralkylaminocarbonyl" refers to -C(0)NRR' in 

which one of R and R' is aryl, preferably lower aryl, more preferably 
phenyl, and the other of R and R' is alkyl, preferably lower alkyl. 

As used herein, "arylaminocarbonyl" refers to -C(0)NHR in which R 
is aryl, preferably lower aryl, more preferably phenyl. 
25 As used herein, "alkoxycarbonyl" refers to -C(0)OR in which R is 

alkyl, preferably lower alkyl. 

As used herein, "aryloxycarbonyl" refers to -C(0)OR in which R is 
aryl, preferably lower aryl, more preferably phenyl. 

As used herein, "alkoxy" and "alkylthio" refer to RO- and RS-, in 
30 which R is alkyl, preferably lower alkyl. 
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As used herein, "aryloxy" and "arylthio" refer to R0- and RS-, in 
which R is aryl, preferably lower aryl, more preferably phenyl. 

As used herein, "alkylene" refers to a straight, branched or cyclic, 
preferably straight or branched, bivalent aliphatic hydrocarbon group, 
5 preferably having from 1 to about 20 carbon atoms, more preferably 1 to 
1 2 carbons, even more preferably lower alkylene. The alkylene group is 
optionally substituted with one or more "alkyl group substituents." There 
can be optionally inserted along the alkylene group one or more oxygen, 
sulphur or substituted or unsubstituted nitrogen atoms, where the 

10 nitrogen substituent is alkyl as previously described. Exemplary alkylene 
groups include methylene (-CH 2 -), ethylene (-CH 2 CH 2 -), propylene 
( — (CH 2 ) 3 -), cyclohexylene (-C 6 H 10 -), methylenedioxy (-0-CH 2 -0-) and 
ethylenedioxy (-0-(CH 2 ) 2 -0-). The term "lower alkylene" refers to 
alkylene groups having 1 to 6 carbons. Preferred alkylene groups are 

15 lower alkylene, with alkylene of 1 to 3 carbon atoms being particularly 
preferred. 

As used herein, "alkenylene" refers to a straight, branched or 
cyclic, preferably straight or branched/bivalent aliphatic hydrocarbon 
group, preferably having from 2 to about 20 carbon atoms and at least 

20 one double bond, more preferably 1 to 12 carbons, even more preferably 
lower alkenylene. The alkenylene group is optionally substituted with one 
or more "alkyl group substituents." There can be optionally inserted 
along the alkenylene group one or more oxygen, sulphur or substituted or 
unsubstituted nitrogen atoms, where the nitrogen substituent is alkyl as 

25 previously described. Exemplary alkenylene groups include 

-CH = CH-CH = CH- and -CH = CH-CH 2 -. The term "lower alkenylene" 
refers to alkenylene groups having 2 to 6 carbons. Preferred alkenylene 
groups are lower alkenylene, with alkenylene of 3 to 4 carbon atoms 
being particularly preferred. 

30 As used herein, "alkynylene" refers to a straight, branched or 

cyclic, preferably straight or branched, bivalent aliphatic hydrocarbon 
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group, preferably having from 2 to about 20 carbon atoms and at least 
one triple bond, more preferably 1 to 12 carbons, even more preferably 
lower alkynylene. The alkynylene group is optionally substituted with one 
or more "alkyl group substituents." There can be optionally inserted 
5 along the alkynylene group one or more oxygen, sulphur or substituted or 
unsubstituted nitrogen atoms, where the nitrogen substituent is alkyl as 
previously described. Exemplary alkynylene groups include 
-C = C-C = C-, -C = C- and -C = C-CH 2 -. The term "lower alkynylene" 
refers to alkynylene groups having 2 to 6 carbons. Preferred alkynylene 

10 groups are lower alkynylene, with alkynylene of 3 to 4 carbon atoms 
being particularly preferred. 

As used herein, "alk(en)(yn)ylene" refers to a straight, branched or 
cyclic, preferably straight or branched, bivalent aliphatic hydrocarbon 
group, preferably having from 2 to about 20 carbon atoms and at least 

15 one triple bond, and at least one double bond; more preferably 1 to 12 
carbons, even more preferably lower alk(en)(yn)ylene. The 
alk(en)(yn)ylene group is optionally substituted with one or more "alkyl 
group substituents." There can be optionally inserted along the 
alkynylene group one or more oxygen, sulphur or substituted or 

20 unsubstituted nitrogen atoms, where the nitrogen substituent is alkyl as 
previously described. Exemplary alk(en)(yn)ylene groups include 
-C = C-(CH 2 ) n -CsC-, where n is 1 or 2. The term "lower 
alk(en)(yn)ylene" refers to alk(en)(yn)ylene groups having up to 6 
carbons. Preferred alk(en)(yn)ylene groups are lower alk(en)(yn)ylene, 

25 with alk(en)(yn)ylene of 4 carbon atoms being particularly preferred. 

As used herein, "arylene" refers to a monocyclic or polycyclic, 
preferably monocyclic, bivalent aromatic group, preferably having from 5 
to about 20 carbon atoms and at least one aromatic ring, more preferably 
5 to 1 2 carbons, even more preferably lower arylene. The arylene group 

30 is optionally substituted with one or more "alkyl group substituents." 
There can be optionally inserted around the arylene group one or more 
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oxygen, sulphur or substituted or unsubstituted nitrogen atoms, where 
the nitrogen substituent is alkyl as previously described. Exemplary 
arylene groups include 1,2-, 1,3- and 1 ,4-phenylene. The term "lower 
arylene" refers to arylene groups having 5 or 6 carbons. Preferred 
5 arylene groups are lower arylene. 

As used herein, "heteroarylene" refers to a bivalent monocyclic or 
multicyclic ring system, preferably of about 5 to about 15 members 
where one or more, more preferably 1 to 3 of the atoms in the ring 
system is a heteroatom, that is, an element other than carbon, for 

10 example, nitrogen, oxygen and sulfur atoms. The heteroarylene group 
can be optionally substituted with one or more, preferably 1 to 3, aryl 
group substituents. 

As used herein, "alkylidene" refers to a bivalent group; such as 
= CR'R", which is attached to one atom of another group, forming a 

15 double bond. Exemplary alkylidene groups are methylidene ( = CH 2 ) and 
ethylidene ( = CHCH 3 ). As used herein, "aralkylidene" refers to an 
alkylidene group In which either FT or R" is and aryl group. 

As used herein, "amido" refers to the bivalent group -C(0)NH-. 
"Thioamido" refers to the bivalent group -C(S)NH-. "Oxyamido" refers to 

20 the bivalent group -OC(0)NH-. Thiaamido" refers to the bivalent group - 
SC(0)NH-. "Dithiaamido" refers to the bivalent group -SC{S)NH-. 
"Ureido" refers to the bivalent group -HNC(0)NH-. "Thioureido" refers to 
the bivalent group -HNC(S)NH-. 

As used herein, "semicarbazide" refers to -NHC(0)NHNH-. 

25 "Carbazate" refers to the bivalent group -OC(0)NHNH-. 

"Isothiocarbazate" refers to the bivalent group -SC(0)NHNH-. 
"Thiocarbazate" refers to the bivalent group -OC(S)NHNH-. 
"Sulfonylhydrazide" refers to the group -S0 2 NHNH-. "Hydrazide" refers 
to the bivalent group -C(0)NHNH-. "Azo" refers to the bivalent group - 

30 N = N-. "Hydrazinyl" refers to the bivalent group -NH-NH-. 
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As used herein, the term "amino acid" refers to a-amino acids 
which are racemic, or of either the D- or L-configuration. The designation 
"d" preceding an amino acid designation ( e.g. , dAla, dSer, d-Val, etc.) 
refers to the D-isomer of the amino acid. The designation "dl" preceding 
5 an amino acid designation ( e.g. , dIPip) refers to a mixture of the L- and D- 
isomers of the amino acid. 

As used herein, when any particular group, such as phenyl or 
pyridyl, is specified, this means that the group is unsubstituted or is 
substituted. Preferred substituents where not specified are halo, halo. 
10 lower alkyl, and lower alkyl. 

As used herein, a composition refers to any mixture of two or more 
products or compounds. It can be a solution, a suspension, liquid, 
powder, a p£ste, aqueous, non-aqueous or any combination thereof. 

As used herein, a combination refers to any association between 
15 two or more items. 

As used herein, fluid refers to any composition that can flow. 
Fluids thus encompass compositions that are in the form of semi-solids, 
pastes, solutions, aqueous mixtures, gels, lotions, creams and other such 
compositions. 

20 As used herein, substantially identical to a product means 

sufficiently similar so that the property of interest is sufficiently 
unchanged so that the substantially identical product can be used in place 
of the product. 

As used herein, the abbreviations for any protective groups, amino 
25 acids and other compounds, are, unless indicated otherwise, in accord 
with their common usage, recognized abbreviations, or the IUPAC-IUB 
Commission on Biochemical Nomenclature (see, Biochem. 1972, 11. 
942). 
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A. Nucleoside Triphosphate Monomers 

Nucleoside triphosphate monomers are provided. The monomers 
are incorporated into oligonucleotide first components during enzymatic 
synthesis. The resulting modified oligonucleotides are readily immobilized 
5 or conjugated through a hydrazone or oxime linkage. The reagents 
possess a triphosphate-ribose-nucleobase motif for recoginition by the 
enzymatic catalyst. 

The monomers also possess, in addition to the triphosphate-ribose- 
nucleobase motif, a protected or unprotected hydrazine protected or 

10 unprotected oxyamino, or a carbonyl moiety for formation of a hydrazone 
or oxime linkage with an appropriately modified surface or second 
component. The hydrazino moiety can be an aliphatic, aromatic or 
heteroaromatic hydrazine, semicarbazide, carbazjde, hydrazide, 
thiosemicarbazide, thiocarazide, carbonic acid dihydrazine or hydrazine 

15 carboxylate (see, FIGURE 2). The protecting groups are salts of the 
hydrazino or oxyamino group, or any amino or hydrazino protecting 
groups known to those of skill in the art (see, e.g. , Greene et at. (1 999) 
Protective Groups in Organic Synthesis (3rd Ed.) (J. Wiley Sons, Inc.)). 
The carbonyl moiety can be any carbonyl containing group capable of 

20 forming a hydrazone or oxime linkage with one or more of the above 
hydrazino or oxyamino moieties. Preferred carbonyl moieties include 
aldehydes and ketones. 

In one embodiment, the monomers for use in the methods provided 
herein have formula (I): 

25 P'-S'-B'-M-X 

or a dervative thereof, as defined herein, where P 1 is a triphosphate 
group, as defined herein, and is preferably at the 5/ position of S 1 ; X is a 
protected or unprotected hydrazino group; S 1 is a ribose, deoxyribose 
including a 2'-deoxyribose, or a dideoxyribose including a 2', 3'- 

30 dideoxyribose; B 1 is a nucleobase; and M is a divalent group having any 
combination, preferably 1-2000, more preferably 1-1000, particularly 1- 
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100, most preferably 1-50 or 1-25 or 1-10, of the following groups, 
which can be combined in any order: arylene, heteroarylene, 
cycloalkylene, C(R*) 2 , -C(R') = C(R')-, >C = C(R 2 )(R 3 )> >C(R 2 )(R 3 ), -C = C-, 
O, S(A) a , P(D) b (R'), P(D) b (ER'), N(R'), >N + (R 2 )(R 3 ) and C(E); where a is 0, 
1 or 2; b is 0, 1, 2 or 3; A is O or NR 1 ; D is S or O; and E is S, O or NR 1 ; 
each R 1 is a monovalent group independently selected from hydrogen and 
M 1 -R 4 ; each M' is a divalent group independently having any combination 
of the following groups, which groups can be combined in any order: a 
direct link, arylene, heteroarylene, cycloalkylene, C(R 5 ) 2 , -C(R 5 ) = C{R 5 )-, 
>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C^C-, O, S(A) a , P(D) b (R 5 ), P(D) b {ER 5 ), N(R 5 ), 
N(COR 5 ), >N + (R 2 HR 3 ) and C{E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A 
is O or NR 5 ; D is S or O; and E is S, O or NR 5 ; R<* and R s are each 
independently selected from among hydrogen, halo, pseudohalo, cyano, 
azido, nitro, SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, 
aralkyl, aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R 10 ; R 9 and R 10 are each independently selected from hydrogen, 
alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 
and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 
independently a monovalent group selected from hydrogen, alkyl, alkenyl, 
alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 
heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; and 

R\ R 2 , R 3 , R\ R 5 , R 6 , R 7 , R 8 ,.R 9 and R'° can be substituted with 
one or more substituents each independently selected from Z, wherein Z 
is selected from alkyl, alkenyl, alkynyl, aryl, cycloalkyl, cycloalkenyl, 
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hydroxy, S(0) h R 20 , NR 20 R 2 ', COOR 20 , COR 20 , CONR 20 R 2 \ OC(O)NR 20 R 2 \ 
N(R 20 )C(O)R 2 ', alkoxy, aryloxy, heteroaryl, heterocyclyl, heteroaryloxy, 
heterocyclyloxy, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, heteroar- 
alkenyl, heteroaralkynyl, aralkoxy, heteroaralkoxy, alkoxycarbonyl, carba- 
5 moyl, thiocarbamoyl, alkoxycarbonyl, carboxyaryl, halo, pseudohalo, halo- 
alkyl and carboxamido; h is 0, 1 or 2; and R 20 and R 21 are each 
independently selected from among hydrogen, halo, pseudohalo, cyano, 
azido, nitro, trialkylsilyl, dialkylarylsilyl, alkyldiarylsilyl, triarylsilyl, alkyl, 
alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 

10 heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy, amino, amido, alkylamino, 
dialkylamino, alkylarylamino, diarylamino and arylamino. 

In another embodiment, the monomers for use in the methods 

15 provided herein have formula (I): 
P'-S'-B'-M-X 

or a dervative thereof, as defined herein, where P 1 is a triphosphate 
group, as defined herein, and is preferably at the 5' position of S 1 ; X is a 
protected or unprotected oxyamino group; S 1 is a ribose, a deoxyribose 

20 including a 2'-deoxyribose, or a dideoxyribose including a 2',3*- 

dideoxyribose; B 1 is a nucleobase; and M is a divalent group having any 
combination, preferably 1-2000, more preferably 1-1000, particularly 1- 
100, most preferably 1-50 or 1-25 or 1-10, of the following groups, 
which can be combined in any order: arylene, heteroarylene, 

25 cycloalkylene, CtR 1 ),,, -C(R 1 ) =C(R 1 )-, >C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C = C-, 
O, S(A) a , PtDJJR 1 ), PtDJ^ER 1 ), NfR 1 ), >N + (R 2 )(R 3 ) and C(E); where a is 0, 
1 or 2; b is 0, 1 , 2 or 3; A is O or NR 1 ; D is S or 0; and E is S, O or NR 1 ; 
each R 1 is a monovalent group independently selected from hydrogen and 
M 1 -R 4 ; each M 1 is a divalent group independently having any combination 

30 of the following groups, which groups can be combined in any order: a 
direct link, arylene, heteroarylene, cycloalkylene, C(R 5 ) 2 , -C(R S ) = C(R 5 )-, 
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>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C-C-. O, S(A) a . P(D) b (R 5 ), P(D) b (ER 5 ), N(R S ), 
N(COR 5 ), >N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A 
is O or NR 5 ; D is S or O; and E is S, O or NR 5 ; R 4 and R 5 are each 
independently selected from among hydrogen, halo, pseudohalo, cyano, 
5 azido, nitro, SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, 
aralkyl, aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R 10 ; R 9 and R 10 are each independently selected from hydrogen, 
10 alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 
and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycfoalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 
1 5 independently a monovalent group selected from hydrogen, alkyl, alkenyl, 
alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 
heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; and 
20 R\ R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 and R 10 can be substituted with 

one or more substituents each independently selected from Z, wherein Z 
is selected from alkyl, alkenyl, alkynyl, aryl, cycloalkyl, cycloalkenyl, 
hydroxy, S(0) h R 20 , NR 20 R 2 \ COOR 20 , COR 20 , CONR 20 R 2 \ OC(O)NR 20 R 2 \ 
N(R 20 )C(0)R 21 , alkoxy, aryloxy, heteroaryl, heterocyclyl, heteroaryloxy, 
25 heterocyclyloxy, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, heteroar- 
alkenyl, heteroaralkynyl, aralkoxy, heteroaralkoxy, alkoxycarbonyl, carta-.-, 
moyl, thiocarbamoyl, alkoxycarbonyl, carboxyaryl, halo, pseudohalo, halo- 
alkyl and carboxamido; h is 0, 1 or 2; and R 20 and R 21 are each 
independently selected from among hydrogen, halo, pseudohalo, cyano, 
30 azido, nitro, trialkylsilyl, dialkylarylsilyl, alkyldiarylsilyl, triarylsilyl, alkyl, 
alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 
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heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylatkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy, amino, amido, alkylamino, 
dialkylamino, alkylarylamino, diarylamino and arylamino. 
5 In another embodiment, the monomers for use in the methods 

provided herein have formula (I): 
P'-S'-B'-M-X 

or a dervative thereof, as defined herein, where P 1 is a triphosphate 
group, as defined herein, and is preferably at the 5' position of S 1 ; X is a 

10 carbonyl group; S 1 is a ribose, a deoxyribose including 2'-deoxyribose, or 
a dideoxyribose including 2',3'-dideoxyribose; B 1 is a nucleobase; and M 
is a divalent group having any combination, preferably 1-2000, more 
preferably 1-1000, particularly 1-100, most preferably 1-50 or 1-25 or 1- 
1 0, of the following groups, which can be combined in any order: 

15 arylene, heteroarylene, cycloalkylene, C{R^) 2 , -CtR 1 ) = C(R 1 )-; 

>C = G{R 2 )(R 3 ), >C(R 2 )(R 3 ), -C^C-, O, S(A) a , P(D) b (R 1 ), PtDUER 1 ), NtR 1 ), 
> N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1 , 2 or 3; A is O or 
NR 1 ; D is S or O; and E is S, O or NR 1 ; each R 1 is a monovalent group 
independently selected from hydrogen and M'-R 4 ; each M 1 is a divalent 

20 group independently having any combination of the following groups, 
which groups can be combined in any order: a direct link, arylene, 
heteroarylene, cycloalkylene, C(R 5 ) 2 , -C(R 5 ) = C(R 5 )-, >C = C(R 2 )(R 3 ), 
>C(R 2 )(R 3 ), -C^C-, O, S(A) a , P(D) b (R 5 ), P(D) b (ER 5 ), N(R 5 ), N(COR 5 ), 
>N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is 0 or 

25 NR 5 ; D is S or O; and E is S, O or NR 5 ; R 4 and R 5 are each independently 
selected from among hydrogen, halo, pseudohalo, cyano, azido, nitro, 
SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 

30 heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R 10 ; R 9 and R'° are each independently selected from hydrogen, 
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alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 
and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
5 alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 

independently a monovalent group selected from hydrogen, alkyl, alkenyl, 
alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 
heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 

10 alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; and 

R\ R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 and R 10 can be substituted with 
one or more substituents each independently selected from Z, wherein Z 
is selected from alkyl, alkenyl, alkynyl, aryl, cycloalkyl, cycloalkenyl, 
hydroxy, S(0) h R 20 , NR 20 R 21 , COOR 20 , COR 20 , CONR 20 R 2 VOC{O)NR 20 R 2 \ 

15 N(R 20 )C(O)R 21 , alkoxy, aryloxy, heteroaryl, heterocyclyl, heteroaryloxy, 
heterocyclyloxy, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, heteroar- 
alkenyl, heteroaralkynyl, aralkoxy, heteroaralkoxy, alkoxycarbonyl, carba- 
moyl, thiocarbamoyl, alkoxycarbonyl, carboxyaryl, halo, pseudohalo, halo- 
alkyl and carboxamido; h is 0, 1 or 2; and R 20 and R 21 are each 

20 independently selected from among hydrogen, halo, pseudohalo, cyano, 
azido, nitro, trialkylsilyl, dialkylarylsilyl, alkyldiarylsilyl, triarylsilyl, alkyl, 
alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 
heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 

25 alkoxy, aryloxy, aralkoxy, heteroaralkoxy, amino, amido, alkylamino, 
dialkylamino, alkylarylamino, diarylamino and arylamino. 

In certain embodiments, the reagents for use in the methods 
provided herein are triphosphates that have formula (I): 
P'-S^-M-X 

30 or a dervative thereof, wherein S 1 is ribose, 2'-deoxyribose, or 2',3'- 
dideoxyribose; X is a protected or unprotected hydrazino, protected or 
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unprotected oxyamino, or a carbonylgroup; B' is a cytosine, uracil, 
adenine, guanine or thymine, or an analog thereof, including, but not 
limited to, 7-deazaadenine, 7-deazaguanine and 7-deazahypoxanthine; 
and P 1 and M are selected as above. 

M preferably has 1-2000, more preferably 1-1000, particularly 1- 
100, most preferably 1-50 or 1-25 or 1-10 of the groups as defined 
above. In certain embodiments, M has 1-50, preferably 1-25, more 
preferably 1-10 of the following groups, which can be combined in any 
order: arylene, heteroarylene, C(R 1 ) 2 , -CfR 1 ) =C(R 1 )-, -CsC-, O, S(A) a , 
NCR 1 ), NfCOR 1 ) and C(E); where a is 0, 1 or 2; A is O or NR 1 ; and E is S, 
O or NR 1 ; where R 1 is as defined above. In more preferred embodiments, 
M has 1-10 of the following groups, which can be combined in any order: 
heteroarylene, C(R%, -C = C-, N(R 1 ) and C(E); where E is S, O or NR 1 ; 
where R 1 is as defined above. In other embodiments, M has 1-10 of the 
following groups, which can be combined in any order: arylene, C(R%, 
-G = C-, N(R 1 ) and C{E); where E is S, O or NR 1 ; where R 1 is as defined 
above. R 5 .is preferably hydrogen; and E is preferably oxygen. In further 
embodiments, M has 1-10 of the following groups, which can be 
combined in any order: C(R 1 ) 2 , -CsC-, NCR 1 ) and C(E); where E is S, O or 
NR 1 ; where R 1 is as defined above. In these embodiments, the 
heteroarylene groups of M are preferably monocyclic or fused bicyclic 
divalent groups, more preferably monocyclic divalent groups, including, 
but not limited to, thienylene, furylene, pyrrolylene including 2,3-, 2,4- 
and 2,5-pyrrolylene, pyridylene including 2,3-, 2,4-, 2,5- and 2,6- 
pyridylene, triazinylene including 1,3,5-triazinylene, triazolylene including 
1,2,4- and 1,2,3-triazolylene or pyrimidylene including 2,4-, 2,5- and 2,6- 
pyrimidylene groups. The arylene groups of M are preferably divalent 
monocyclic, bicyclic or tricyclic groups, including, but not limited to, 1,2-, 
1,3- and 1 ,4-phenylene, and naphthylene. 

In other embodiments, M is a chain of repeating monomer units, 
preferably 1-2000 monomer units, more preferably 1-1000, particularly 1- 
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100, most preferably 1-50 or 1-25 or 1-10, including, but not limited to, 
ethylene oxide, propylene oxide, methacrylamide, or ethylene glycol. 
Thus, in certain embodiments, M is a polyethylene glycol (PEG), poly- 
propylene glycol (PPG) or acrylate chain. In particular embodiments, M is 
a diethyleneglycol, triethyleneglycol or tetraethyleneglycol moiety. 

In other embodiments, M contains a cleavable linkage, including, 
but not limited to, a disulfide, an ester, an enzyme specific peptide, a 
photocleavable linkage, such as nitroaromatic group, or an acid labile 
group. Monomers of formula (I) and modified oligonucleotides of formula 
(II) containing such M groups have applications in capture assay methods 
where the analyte is captured by the modified oligonucleotide, the 
conjugate isolated, and the M group cleaved to provide an isolated 
analyte. Sde, e^, Niemeyer et aL (1994) Nucl. Acids Res. 22(25) :5530- 
5539. 

X is preferably -C(0)R 30 , -Y-N(R 31 )-Y%N(R 32 )-Y 2 or -O-N(R 30 )-Y 2 ; 
where R 30 , R 3 ' and R 32 are each independently hydrogen, alkyl, alkenyl, 
alkynyl, aryl, heteroaryl, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, 
heteroaralkenyl, heteroaralkynyl, heterocyclyl or cycloalkyl; Y and Y 1 are 
selected as in (i) or (ii) as follows: 

(i) Y is a direct link, and Y 1 is a direct link, C(0)N(R 35 ), 
N(R 3S )C(0)N(R 36 ), C(S)N(R 35 ), N(R 35 )C(S)N(R 36 ) or 
C(0)N(R 35 )N(R 36 )C(0)N(R 37 ); or 

(ii) Y is C(O) or OC(O), and Y 1 is a direct link; 

where R 35 , R 36 and R 37 are each independently selected from 
among hydrogen, alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
aralkenyl, aralkynyl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl> 
heterocyclyl and cylcoalkyl; and 

Y 2 is a salt of the hydrazino or oxyamino group, including but not 
limited to, mineral acids salts, such as but not limited to hydrochlorides 
and sulfates, and salts of organic acids, such as but not limited to 
acetates, lactates, malates, tartrates, citrates, ascorbates, succinates, 
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butyrates, valerates and fumarates, or any amino or hydrazino protecting 
group (see, e.g. , Greene et aL (1 999) Protective Groups in Organic 
Synthesis (3rd Ed.) (J. Wiley Sons, Inc.)); 

where R 30 , R 31 , R 32 , R 35 , R 36 , R 37 and Y 2 are unsubsituted or 
5 substituted with one or more substituents each independently selected 
from Z, wherein Z is selected from alkyl, alkenyl, alkynyl, aryl, cycloatkyl, 
cycloalkenyl, hydroxy, S(0) h R 20 , NR 20 R 2 \ COOR 20 , COR 20 , CONR 20 R 21 , 
OC(O)NR 20 R 21 , N(R 20 )C(O)R 21 , alkoxy, aryloxy, heteroaryl, heterocyclyl, 
heteroaryloxy, heterqcyclyloxy, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, 

10 heteroaralkenyl, heteroaralkynyl, aralkoxy, heteroaralkoxy, 

alkoxycarbonyl, carbamoyl, thiocarbamoyl, alkoxycarbonyl, carboxyaryl, 
halo, pseudohalo, haloalkyl and carboxamido; h is 0, 1 or 2; and R 20 and 
R 21 are each independently selected from among hydrogen, halo, 
pseudohalo, cyano, azido, nitro, trialkylsilyl, dialkylarylsilyl, alkyldiarylsilyl, 

15 triarylsilyl, alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkeny(, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy, 
amino, amido, alkylamino, dialkylamino, alkylarylamino, diarylamino and 

20 arylamino. 

In certain embodiments herein, Y 2 is selected from any amino or 
hydrazino protecting group, preferably monomethoxytrityl (MMT), 9- 
fluorenylmethoxycarbonyl (FMOC), acetyl, benzoyl, or a hydrazone or 
oxime that is cleaved under mild acidic conditions ( e.g. . 100 mM acetate, 

25 pH 4.5-5.5) including, but not limited to, a hydrazone or oxime formed 
from a lower aliphatic aldehyde or ketone, preferably from acetone, 
cyclohexanone, propanal or 2-butanone. In other embodiments, 
particularly when X is a protected oxyamino group, the protecting group 
is a phthalimidyl group. 

30 The monomers provided herein are added to the growing 

oligonucleotide chain during enzymatic synthesis. In the case of terminal 



-35- 



C. < 



C ( 



37154-752 



modification, wherein purification of the final product using a lipophilic 
handle is required, the amino or hydrazino protecting group is MMT, or a 
similar moiety. The final protected hydrazino or protected oxyamino 
substituted product is deprotected following high performance liquid 
-chromatography (HPLC) purification. 
1 . Hydrazino Derivatives 

In one preferred embodiment, the monomers for use in the 
methods provided herein contain a triphosphate group, as defined herein, 
a sugar group, a nucleobase, and a hydrazino group (see, FIGURE 2). 
Thus, in this embodiment, the compounds of formula (I) have formula 
(III): 

P'-S'-B^lvVY-NfR^-Y'-NtR 32 )^ 2 , 

or a derivative thereof, where P 1 is a triphosphate group, preferably 
located at the 5' position of S 1 ; 

M is a divalent group having any combination, preferably 1-2000, 
more preferably 1-1000, particularly 1-100, most preferably 1-50 or 1-25 
or 1-10, of the following groups, which can be combined in any order: 
arylene, heteroarylene, cycloalkylene, C(R 1 ) 2 , -CfR 1 ) = C(R 1 )-, 
>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C = C-, O, S(A) a , PiDl^R 1 ), PtDMER 1 ). N(R 1 ), 
>N + (R 2 )(R 3 ) and C{E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is O or 
NR 1 ; D is S or O; and E is S, O or NR 1 ; each R 1 is a monovalent group 
independently selected from hydrogen and M'-R 4 ; each M 1 is a divalent 
group independently having any combination of the following groups, 
which groups can be combined in any order: a direct link, arylene, 
heteroarylene, cycloalkylene, C(R 5 ) 2 , -C(R 5 ) =C(R 5 )-, >C = C(R 2 )(R 3 ), 
>C(R 2 )(R 3 ), -C-C-, O, S(A) 8 , P(D) b (R s ), P(D) b (ER 5 ), N(R 5 ), N(COR 5 ), 
>N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is 0 or 
NR 5 ; D is S or 0; and E is S, 0 or NR 5 ; R 4 and R 5 are each independently 
selected from among hydrogen, halo, pseudohalo, cyano, azido, nitro, 
SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkdxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
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heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R'°; R 9 and R 10 are each independently selected from hydrogen, 
alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 
5 and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 
independently a monovalent group selected from hydrogen, alkyl, alkenyl, 

10 alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 

heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; 
Y and Y 1 are selected as in (i) or (ii) as follows: 

15 (i) Y is a direct link, and Y 1 is a direct link, C(0)N(R 35 ), 

N(R 35 )C(0)N(R 36 ), C(S)N(R 35 ), N(R 35 )C(S)N(R 36 ) or 
C(0)N(R 35 )N(R 36 )C(0)N(R 37 ); or 

(ii) Y is C(O) or OC(O), and Y 1 is a direct link; 
where R 35 , R 36 and R 37 are each independently selected from 
20 among hydrogen, alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
aralkenyl, aralkynyl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, 
heterocyclyl and cylcoalkyl; 

Y 2 is a salt of the hydrazino group, including but not limited to, 
mineral acids salts, such as but not limited to hydrochlorides and sulfates, 
25 and salts of organic acids, such as but not limited to acetates, lactates, 
malates, tartrates, citrates, ascorbates, succinates, butyrates, valerates 
and fumarates, or any amino or hydrazino protecting group, preferably an 
amino or hydrazino protecting group useful in the synthesis of 
oligonucleotides, more preferably monomethoxytrityl (MMT), 
30 dimethoxytrityl (DMT), 9-fluorenylmethoxycarbonyl (FMOC), acetyl, 

trifluoroacetyl (TFA), benzoyl, or a hydrazone that is cleaved under mild 
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acidic conditions ( e.g. , 100 mM acetate, pH 4.5-5.5) including, but not 
limited to, a hydrazone formed from a lower aliphatic aldehyde or ketone, 
preferably from acetone, cyclohexanone, propanal or 2-butanone; and 

R 3 ' and R 32 are each independently hydrogen, alkyl, alkenyl, 
alkynyl, aryl, heteroaryl, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, 
heteroaralkenyl, heteroaralkynyl, heterocyclyl or cycloalkyl; 

where R 1 , R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 , R 10 , R 31 , R 3 *, R 38 , R 3 *, r 3 ? 
and Y 2 are unsubstituted or substituted with one or more substituents 
each independently selected from Z, as defined above. 

In preferred embodiments, R 31 , R 32 , R 35 , R 36 and R 37 are hydrogen 
or lower alkyl, more preferably hydrogen. In these embodiments, the 
compounds of formula (III) have formula (IV): 
P'-S'-B'-M-Y-NH-Y^NH-Y 2 , 

or a derivative thereof, where Y and Y 1 are selected from (i) or (ii) as 
follows: 

(i) Y is a direct link, and Y 1 is a direct link, C(0)NH, NHC(0)NH, 
C(S)NH, NHC(S)NH or C(0)NHNHC(0)NH; or 

(ii) Y is C(O) or OC(O), and Y 1 is a direct link; 
and P\ S\ B 1 , M and Y 2 are selected as above. 

In a preferred embodiment, the compounds for use in the methods 
provided herein are of formula (IV) where M has 1-50, preferably 1-25, 
more preferably 1-10 of the following groups, which can be combined in 
any order: arylene, heteroarylene, C(R 1 ) 2 , -C(R , ) = C(R 1 )-, -CsC-, O, 
S(A) a , NHR 1 ), N(COR 1 ) and C(E); where a is 0, 1 or 2; A is 0 or NR 1 ; and E 
is S, O or NR 1 ; where R 1 is selected as above. In more preferred 
embodiments, M has 1-10 of the following groups, which can be 
combined in any order: heteroarylene, C(R 1 ) 2 , -CsC-, N(R') and C(E); 
where E is S, O or NR 1 ; where R 1 is selected as above. R 1 is preferably 
hydrogen; and E is preferably oxygen. 

In certain embodiments herein, the group attached to Y is a 
heteroarylene group, preferably a nitrogen-containing heteroarylene group. 
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preferably a pyrrolylene including 2,3-, 2,4- and 2,5-pyrrolylene, 
pyridylene including 2,3-, 2,4-, 2,5- and 2,6-pyridylene, triazinylene 
including 1 ,3,5-triazinylene, triazolylene including 1,2,4- and 1,2,3- 
triazolylene or pyrimidylene including 2,4-, 2,5- and 2,6-pyrimidylene 
5 group, more preferably a pyridylene group, most preferably a 2,5- 
pyridylene group. Thus, in these more preferred embodiments, M 
includes 2,5-pyridylene attached to Y, preferably Y is at the 2-position of 
the pyridylene group, and 1-9 of the following groups, which can be 
combined in any order: CH 2 , -C = C-, NH and C{0). 
10 In more preferred embodiments, M has the formula: 



O 




20 2. Carbonyl Derivatives 

In another preferred embodiment, the monomers for use in the 
methods provided herein contain a triphosphate group, as defined herein, 
a sugar group, a nucleobase, and a carbonyl group. Thus, in this 
embodiment, the compounds of formula (I) have formula (V): 

25 P'-S'-B^-M-CfOR 30 , 

or a derivative thereof, where P 1 is a triphosphate group, preferably 
located at the 5' position of S'; 

M is a divalent group having any combination, preferably 1-2000, 
more preferably 1-1000, particularly 1-100, most preferably 1-50 or 1-25 

30 or 1-10, of the following groups, which can be combined in any order: 
arylene, heteroarylene, cycloalkylene, C{R X ) 2 , -CfR 1 ) = C(R')-, 
>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C^C-, O, S(A) a , P(D) b (R'), P(D) b (ER 1 ), N(R'), 
>N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is O or 
NR 1 ; D is S or O; and E is S, O or NR 1 ; each R 1 is a monovalent group 
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independently selected from hydrogen and M'-R 4 ; each M 1 is a divalent 
group independently having any combination of the following groups, 
which groups can be combined in any order: a direct link, arylene, 
heteroarylene, cycloalkylene. C(R 5 ) 2/ -C(R 5 ) =C(R 5 )-, >C = C(R 2 )(R 3 ), 
5 >C(R 2 MR 3 ), -C = C-, O. S(A) a , P(D) b (R 5 ), P(D) b (ER 6 ), N(R 5 ), N(COR 5 ). 
>N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is O or 
NR 5 ; D is S or O; and E is S, O or NR 5 ; R 4 and R 5 are each independently 
selected from among hydrogen, halo, pseudohalo, cyano, azido, nitro, 
SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 

10 aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 

heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R 10 rT? 9 and R 1p are each independently selected from hydrogen, 
alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 

15 and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 
independently a monovalent group selected from hydrogen, alkyl, alkenyl, 

20 alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 

heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; 

R 30 is hydrogen, alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 

25 aralkenyl, aralkynyl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, 
heterocyclyl or cycloalkyl; 

where R', R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 , R 10 and R 30 are 
unsubstituted or substituted with one or more substituents each 
independently selected from Z, as defined above. 
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In preferred embodiments, R 30 is hydrogen or lower alkyl, more 
preferably hydrogen or methyl, most preferably hydrogen. In these 
embodiments, the compounds of formula (V) have formulae (VI): 
P'-S'-B'-M-CHO or P'-S'-B'-M-CfOJMe, 

or a derivative thereof, where P\ S\ B 1 and M are selected as above. 

In a preferred embodiment, the compounds for use in the methods 
provided herein are of formula (VI) where M has 1-50, preferably 1-25, 
more preferably 1-10 of the following groups, which can be combined in 
any order: arylene, heteroarylene, G(R 1 ) 2 , -C(R 1 ) = C(R 1 )-, -C = C-, O, 
S(A) a , NfR 1 ), N(COR') and C(E); where a is 0,1 or 2; A is O or NR 1 ; and E 
is S, O or NR 1 ; where R 1 is selected as above. In more preferred 
embodiments, M has 1-10 of the following groups, which can be 
combined in any order: arylene, C{R% -CsC-, N(R l ) and C(E); where E 
is S, O or NR 1 ; where R 1 is selected as above. R 1 is preferably hydrogen; 
and E is preferably oxygen. In certain embodiments herein, the group 
attached to CHO or C(0)Me is a arylene group, preferably a monocyclic 
arylene group, more preferably phenylene. Thus, in these more preferred 
embodiments, M includes phenylene attached to CHO or C(0)Me, and 1-9 
of the following groups, which can be combined in any order: CH 2 , 
-C = C-, NH and C(O). In other embodiments herein, M preferably 
includes 1-10, more preferably 1-6, of the following groups, which can 
be combined in any order: CH 2 , -C = C-, NH and C(O). 

In more preferred embodiments, M has the formulae: 



O 




H 



N 



o 
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3. Oxyamino Derivatives 

In one preferred embodiment, the reagents for use in the methods 
provided herein contain a triphosphate group, as defined herein, a sugar 
group, a nucleobase, and an oxyamino group. The oxyamino group is 
5 preferably protected as described above. Thus, in this embodiment, the 
compounds of formula (I) have formula (VII): 
P'-S'-B'-M-O-NH-Y 2 , 

or a derivative thereof, where P' is a triphosphate group, preferably 
located at the 5' position of S 1 ; S 1 and B 1 are selected as above; and 

10 M is a divalent group having any combination, preferably 1-2000, 

more preferably 1-1000, particularly 1-100, most preferably 1-50 or 1-25 
or 1-10, of the following groups, which can be combined in any order: 
arylene, heteroarylene, cycloalkylene, C(R 1 ) 2 , -CfR 1 ) = C(RV, 
>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -ChC-, O, S(A) a , PtDyR 1 ), P{D) b (ER 1 ) f NfR 1 ), 

15 >N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A is O or 
NR 1 ; D is S or O; and E is S, O or NR 1 ; each R 1 is a monovalent group 
independently selected from hydrogen and M'-R 4 ; each M 1 is a divalent 
group independently having any combination of the following groups, 
which groups can be combined in any order: a direct link, arylene, 

20 heteroarylene, cycloalkylene, C(R S ) 2 , -C(R 5 ) =C(R 5 )-, >C = C{R 2 )(R 3 ), 
>C(R 2 )(R 3 ), -C = C-, O, S{A) a , P(D) b (R 5 ), P(D) b (ER 5 ), N(R 5 ), N(COR 5 ), 
>N + (R 2 )(R 3 ) and C{E); where a is 0, 1 or 2; b is 0, 1,2 or 3; A is O or 
NR 5 ; D is S or O; and E is S, O or NR 5 ; R 4 and R 5 are each independently 
selected from among hydrogen, halo, pseudohalo, cyano, azido, nitro, 

25 SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R 10 ; R 9 and R 10 are each independently selected from hydrogen, 

30 alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl, heteroaryl, heteroaralkyl 
and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
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and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 
independently a monovalent group selected from hydrogen, alkyl, alkenyl, 
5 alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 

heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; 

Y 2 is a salt of the oxyamino group, including but not limited to, 

10 mineral acids salts, such as but not limited to hydrochlorides and sulfates, 
and salts of organic acids, such as but not limited to acetates, lactates, 
malates, tartrates, citrates, ascorbates, succinates, butyrates, valerates 
and fumarates, or any amino protecting group, preferably an amino 
protecting group useful in the synthesis of oligonucleotides, more 

15 preferably monomethoxytrityl (MMT), dimethoxytrityl (DMT), 9-fluorenyl- 
methoxycarbonyl (FMOC), acetyl, trifluoroacetyl (TFA), benzoyl, or an 
oxime that is cleaved under mild acidic conditions ( e.g. , 100 mM acetate, 
pH 4.5-5.5) including, but not limited to, an oxime formed from a lower 
aliphatic aldehyde or ketone, preferably from acetone, cyclohexanone, 

20 propanal or 2-butanone; 

where R\ R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 , R 10 and Y 2 are 
unsubstituted or substituted with one or more substituents each 
independently selected from Z, as defined above. 

In a preferred embodiment, the compounds for use in the methods 

25 provided herein are of formula (VII) where M has 1-50, preferably 1-25, 
more preferably 1-10 of the following groups, which can be combined in 
any order: arylene, heteroarylene, C(R 1 ) 2 , -C(R 1 ) = C(R 1 )-, -C = C-, O, 
S(A) a , N(R 1 ), N(COR 1 ) and C(E); where a is 0, 1 or 2; A is 0 or NR 1 ; and E 
is S, O or NR 1 ; where R 1 is selected as above. In more preferred 

30 embodiments, M has 1-10 of the following groups, which can be 

combined in any order: C{R } ) 2 , -CsC-, N(R ] ) and C(E); where E is S, O or 
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NR 1 ; where R 1 is selected as above. R 1 is preferably hydrogen; and E is 
preferably oxygen. In these more preferred embodiments, M includes 1- 
10, preferably 1-8, of the following groups, which can be combined in 
any order: CH 2 , -C = C-, NH and C(O). 
5 In more preferred embodiments, M has the formula: 



o 




4. Nucleobases B 1 

The nucleobases B 1 of the monomers provided herein are well 
15 known to those of skill in the art, and include, but are not limited to, 

cytosines, uracils, adenines, guanines and thymines, and analogs thereof, 
including, but not limited to, deaza analogs, such 'as 7-deazaadenine, 7- 
deazaguanine and 7-deazahypoxanthine. 

Thus, in preferred embodiments, the compounds of formula (I) 
20 have the formulae: 




where P\ S\ M and X are selected as above. 
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5. Sugars S 1 

The sugars S 1 of the monomers provided herein include, but are 
not limited to, riboses, deoxyriboses including 2'-deoxyriboses, and 
dideoxyriboses including 2',3'-dideoxyriboses. Other sugars for use 
5 herein include arabinoses, including £-D-arabinofuranose, and the riboses 
3'-£-D-ribofuranose, 3'-amino-2',3'-dideoxy-/?-D-ribofuranose, 2',3'- 
dideoxy-3'-fluoro-£-D-fibofuranoseand 2',3'-dideoxy-2',3'-didehydro-£-D- 
ribofuranose. 

In preferred embodiments, the compounds of formula (I) have the 
10 formulae: 



p 1 -o 



B'-M-X 



p 1 -o 



B 1 -M-X 



15 



HO OH 



20 



P 1 -0 



or 



HO 



B-M-X 



25 where P\ B\ M and X are selected as above. 
6. Preferred Monomers 

The presently most preferred compounds of formula (IV) have the 
structure: 



30 



35 



40 
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R R 

20 

where each R is independently H or OH. 



The presently most preferred compounds of formulae (VI) have the 
structures: 

25 



30 



35 



40 



45 



-46- 



371S4-752 




R R 



where each R is independently H or OH. 

The presently most preferred compounds of formula (VII) have the 
structures: 
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ONH 




R R 



where each R is independently selected from H and OH. 
7. Methods of preparation of the Monomers 
Methods of preparation of the monomers are provided. The 
methods involve the steps of: (i) derivatizing a carboxylic acid selected 
5 from an a/-carbonyl, an w-protected hydrazino, and an ^/-protected 
oxyamino substituted carboxylic acid as the corresponding active ester, 
including, but not limited to, aryl, heteroaryl, succinimidyl, maleimidyl, 
phthalimidyl and naphthimidyl, preferably succinimidyl, esters by reacting 
the carboxylic acid with the appropriate aryl, heteroaryl, succinimidyl, 

10 maleimidyl, phthalimidyl or naphthimidyl alcohol, preferably N- 

hydroxysuccinimide, under dehydrating conditions, including, but not 
limited to, in the presence of a carbodiimide, such as 
dicyclohexylcarbodiimide; and (ii) reacting the resulting active ester with 
an amino substituted nucleoside triphosphate, including but not limited to 

15 a 5-amino substituted cytidine triphosphate such as 5-1, as shown in 

FIGURE 5. Preferred w-carbonyl carboxylic acids include 4-formylbenzoic 
acid, levulinic acid and 4-carboxyacetophenone. Preferred w-protected 
hydrazino carboxylic acids include 6-(N'-dimethoxytrityl)hydrazinonicotinic 
acid. Preferred unprotected oxyamino carboxylic acids include 6-(N- 

20 dimethoxytrityDoxyaminonicotinic acid. 
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B. Modified Oligonucleotides, Immobilized Oligonucleotides and 
Oligonucleotide Conjugates 

1 . Modified Oligonucleotides 

Provided herein are modified oligonucleotides that possess a 
5 hydrazino group (see, FIGURE 2), an oxyamino group, or a carbonyl 
group. These modified oligonucleotides are synthesized directly during 
enzymatic synthesis without the need for post synthetic modification to 
produce the hydrazine oxyamino, or carbonyl group. The modified 
oligonucleotide first components can be immobilized or conjugated 

10 through the formation of covalent hydrazone or oxime bonds. Such 
hydrazone or oxime bonds can be formed by reaction of the hydrazino, 
oxyamino or carbonyl modified oligonucleotide first components with 
appropriately derivatized solid surfaces, as described herein; or second 
components, including, but not limited to, biopolymers as defined herein, 

15 polymers including, but not limited to, polyamines, polyamides, 

polyethers and polyethylene glycols, and other compounds of interest 
herein for use in assays, kits, diagnostic arrays, and the like, including, 
but not limited to, intercalators, vitamins, reporter molecules, 
cholesterols, phospholipids, biotin, phenazine, phenanthridine, 

20 anthraquinone, acridine, fluoresceins, rhodamines, coumarins, dyes, 

antibodies, haptens, antigens, enzymes, and detection reagents including, 
but not limited to, fluorophores, metals including, but not limited to, gold, 
metal chelates, chromophores, fluorophore precursors and chromophore 
precursors, that are modified to possess a hydrazino, oxyamino or 

25 carbonyl group that is complementary to the carbonyl, oxyamino or 
hydrazino group of the oligonucleotide of formula (II) for formation of 
hydrazone or oxime linkage. Dendrimeric compounds that possess a 
plurality of detectable groups, including, but not limited to, reporter 
molecules, fluorophores, chromophores, fluorophore precursors and 

30 chromophore precursors, are also contemplated herein as second 

components. Fluorophore precursors and chromophore precursors are 
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compounds that react with the hydrazine oxyamino or carbonyl group of . 
the modified oligonucleotide to form a fluorogenic or chromogenic group 
for analysis. Fluorophore precursors and chromophore precursors are well 
know to those of skill in the art, and include, but are not limited to, 3-(4- 
5 carboxybenzoyl)quinoIine-2-carboxaldehyde (CBQCA) and 3-(2- 

furoyl)quinoIine-2-carboxaldehyde (FQ), sold by Molecular Probes, Inc. 
(Eugene, Oregon). 

Importantly, the reaction to form the immobilized oligonucleotide or 
oligonucleotide conjugate can be performed under aqueous conditions 
10 without the need for additional reagents, such as a reducing agent. 

The modified oligonucleotides provided herein have formula (II): 

P 2 n 1/P2 "^° 2 
OS or 
15 | r 1 Y 

M 



2 



20 or a derivative thereof, where O 1 and O 2 are each independently an 

oligonucleotide or analog thereof, such as a protein nucleic acid (PNA); P 
is a phosphorous linking group resulting from the coupling of a compound 
of formula (I) with the oligonucleotide or analog thereof, preferably a 
phosphodiester group; and S\ B\ M and X are selected as above for the 

25 hydrazino, oxyamino, and carbonyl monomers. 

In certain embodiments, the oligonucleotide analogs of formula (II) 
are selected with the proviso that if X is CHO, then M is not 
undecylenecarbonylaminomethylene (n-C n H 22 C(0)NHCH 2 ) or 
C^alkylene. ' n other embodiments, the oligonucleotide analogs are of 

30 formula (II) where X is not an oxyamino or protected oxyamino group. In 
further embodiments, the oligonucleotide analogs are of formula (II) 
where X is not a hydrazide (-C(0)--NHNH 2 ) group. In particular 
embodiments, the oligonucleotide analogs of formula (II) are selected with 
the proviso that if X is -NHNH 2 , then M is not -CH(CH 2 OH)(CH 2 ) 6 NHCO-5- 
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pyrid-2-ylene. In other embodiments, the oligonucleotide analogs of 
formula (II) are selected with the proviso that P 2 is not a phosphoramidate 
group. 

Thus, in more preferred embodiments, the modified 
5 oligonucleotides have formulae: 



15 



20 



25 



30 



35 



40 
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NHNH 2 CHO 




where R 40 is selected from an oligonucleotide, OH and H; and R 41 is 
selected from OH and H. 

Methods of synthesis of the modified oligonucleotides are also 
provided. The methods involve the step of contacting a composition 
5 having at least one nucleoside triphosphate provided herein with an 
enzyme for oligonucleotide synthesis under conditions where 
oligonucleotide synthesis occurs. Preferred enzymes include reverse 
trascriptases, including, but not limited to, AMV reverse transcriptase, 
MMLV reverse transcriptase and superscript reverse transcriptase, and 

10 polymerases, including, but not limited to, Taq polymerase, DNA 
plymerase, Klenow fragment and T4 DNA polymerase. 

The modified oligonucleotides provided herein are also useful in 
studying how a normal cell differs for its diseased state. Current gen 
expression sutdies require the determination of the expression of any new 

1 5 genes as well as the difference in expression of genes in teh two cell 
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states. To this end, RNA from the two cell populations are isolated and 
cDNA libraries are prepared using reverse transcriptase enzymes and 
fluorescently labeled triphosphates such as Cy3 and Cy5 triphosphates. 
It is recognized that the stability and kinetics of incorporation of the 
5 labeled triphoshpates are not identical. The modified triphosphates 

provided herein may be used in both applications. This is advantageous 
over current methodologies, e.g. , amino substituted nucleoside 
triphosphates (Clonetech) labeled with succinimidyl fluorophores, due to 
the inherent instability of the succinimidyl esters and the low 
10 concentration fo amino groups incorporated in the cDNA product. 
2. Immobilized Oligonucleotides 

Oligonucleotides immobilized through a hydrazone or oxime linkage 
to a solid surface are provided. The oligonucleotides are or formula (II), 
and are modified with a hydrazino, oxyamino, or carbonyl moiety. 

15 Reaction of the modified oligonucleotide with an appropriately 

functionalized solid surface provides an immobilized oligonucleotide for 
use in preparation of microarrays, diagnostic probe-assays, DNA 
amplification by solid phase polymerase chain reactions (PCR), molecular 
computing (see, e,g_., Adleman ( 1 994) Science 266 : 1 021-1 024; Kari 

20 (1 997) Mathematical Intelligencer 19:9-22; Frutos et aL (1 997) Nucleic 
Acids Res. 25:4748; Smith et aL (1998) J. Comp. Biol. 5:255: Liu et aL 

(1 998) J. Comp. Biol. 5:267; Frutos et aL (1998) J. Am. Chem. Soc. 
120:10277; Wang et aL (1999) Biosvsterris 52:189-191: Liu et aL 

(1999) Biosvstems 52:25-33: Liu et aL (POOP) Ngtnrg 403-175-170; 
25 European Patent Application Publication No. EP 0 772 1 35; Reed et aL 

(June 2000) Scientific American :86-93L molecular addressing (Niemeyer 
et aL (1994) Nucl. Acids Res. 22(25) :5530-fiK3m DNA sequencing by 
mass spectrometry (see, e^g., U.S. Patent Nos. 6,074,823 and 
5,547,835) and in studying the molecular electronics of DNA (see, e.g. , 
30 U.S. Patent Nos. 6,071,699, 6,066,448, 5,952,172 and 5,824.473). 
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Importantly, the reaction to form the immobilized oligonucleotide 
can be performed under aqueous conditions without the need for 
additional reagents, such as a reducing reagent. 

The modified solid surfaces are prepared on a variety of supports, 
5 as described herein. Appropriately modified solid surfaces can be 
prepared by the methods provided herein and possess carbonyl, 
oxyamino, or hydrazino groups complementary to the modified 
oligonucleotide. Reaction of the modifed oligonucleotide with the 
modified solid surface results in coyalent attachment of the 

10 oligonucleotide to the solid surface through a hydrazone or oxime linkage. 
In embodiments where X of the modified oligonucleotide is a 
hydrazino or oxyamino group, the solid surface can be modified to 
possess an 'epoxide, a-bromocarbonyl, maleimide, maleic anhydride, 
isothiocyanate or isocyanate group. Such solid surfaces can be prepared 

15 by methods provided herein or other methods well known to those of skill 
in the art. For example, reaction of pentafluoropheny! 4-isothiocyanato- 
benzoate with an amino or hydroxy solid surface results in formation of 
an isothiocyanato modified solid surface. Some of these surfaces are 
commercially available from, e.g. . Pierce (Rockford, Illinois):, -SINTEF 

20 Applied Chemistry (Trondheim, Norway), Rapp Polymere Gmbh 

(Tubingen, Germany), and Dyno Particles AS (Trondheim, Norway). 
Reaction of the hydrazino or oxyamino group of the modified 
oligonucleotide with the epoxide, a-bromocarbonyl, maleimide, maleic 
anhydride, isothiocyanate or isocyanate group of the solid surface results 

25 in covalent attachment of the oligonucleotide to the solid surface. 

In certain embodiments, particularly where X is an oxyamino group, 
the immobilized oligonucleotides are selected such that the solid surfaces 
is not modified with an aldehyde or epoxide group. 
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3. Oligonucleotide Conjugates 

Oligonucleotide first components conjugated to second 
components, including, but not limited to, biopolymers as defined herein, 
polymers including, but not limited to, polyamines, polyamides, 
polyethers and polyethylene glycols, and other compounds of interest 
herein for use in assays, kits, diagnostic arrays, and the like, including, 
but not limited to, intercalators, vitamins, reporter molecules, 
cholesterols, phospholipids, biotin, phenazine, phenanthridine, 
anthraquinone, acridine, fluoresceins, rhodamines, coumarins, dyes, 
antibodies, haptens, antigens, enzymes, and detection reagents including, 
but not limited to, fluorophores, metals including, but not limited to, gold, 
metal chelates, chromophores, fluorophore precursors and chromophore 
precursors, that are modified to possess a hydrazine oxyamino or 
carbonyl group that is complementary to the carbonyl, oxyamino or 
hydrazino group of the modified oligonucleotide of formula (II) for 
formation of hydrazone or oxime linkages are provided. Dendrimeric 
compounds that possess a plurality of detectable groups, including, but 
not limited to, reporter molecules, fluorophores, chromophores, 
fluorophore precursors and chromophore precursors, are also 
contemplated herein as second components. Fluorophore precursors and 
chromophore precursors are compounds that react with the hydrazino, 
oxyamino or carbonyl group of the modified oligonucleotide to form a 
fluorogenic or chromogenic group for analysis. 

Reaction of a modified oligonucleotide of formula (II) with an 
appropriately modified second component provides a hydrazone or oxime 
linked conjugate. Thus, in one embodiment, a hydrazino modified 
oligonucleotide of formula (II) reacts with a carbonyl containing second 
component, or a second component modified to possess a carbonyl 
group, such as an aldehyde or a ketone, to afford the conjugate provided 
herein. In another embodiment, a carbonyl modified oligonucleotide of 
formula (II) is reacted with a second component containing a hydrazino 
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group, or modified to contain a hydrazino group, to afford the conjugate 
provided herein. In other embodiments, the modified oligonucleotide 
possesses an oxyamino or carbonyl group, and the second component 
possesses a complementary carbonyl or oxyamino group, or is modified 
5 to possess a complementary carbonyl or oxyamino group, to afford an 
oxime linked conjugate. 

Importantly, the reaction to form the oligonucleotide conjugate can 
be performed under aqueous conditions without the need for additional 
reagents, such as a reducing agent. 

10 C. Hydrazino, Oxyamino and Carbonyl Modified Surfaces 

Provided herein are reagents and methods for modification of solid 
surfaces and second components for use in the methods provided herein. 
In particular^ the modified solid surfaces and second components are 
useful for the immobilization of the oligonucleotide first components 

15 provided herein, and in the conjugation of such first components, 
respectively. 

It is to be understood that the reagents described below are 
interchangeable in certain applications. For example, a dihydrazine 
reagent for modification of oligosaccharides can also be used in the 
20 modification of solid surfaces to provide a hydrazino solid surface. Many 
of the other reagents provided below can be used in the preparation of 
either modified solid surfaces or in the modification of second 
components for conjugation with the modified oligonucleotides provided 
herein. 

25 1 . Modified Surfaces 

Hydrazino, oxyamino and carbonyl modified surfaces are provided. 
The surfaces provided herein include, but are not limited to, glasses 
including glasses including controlled pore glass, plastics including 
potyethylenes; polycarbonates, polypropylenes, polyamides such as 

30 Nylon® and polyacrylamides, and polyvinyldenedifluorides, latexes, 
rubbers, celluloses including nitrocelluloses and diazatized celluloses, 
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latexes, metals including platinum, nickel, zinc, tin, palladium, steel, gold, 
silver, aluminum, silicon and copper, silicas, agaroses including 
Sephadex , dextrans including Sepharose', natural sponges, polystyrenes 
including polystyrene crosslinked with divinylbenzene, polystyrenes 
radiation grafted onto perfluorinated polymers such as Teflon" 1 (see, e.g. . 
Maeji et aL (1994) Reactive Polymers 22:203-212: and Berg et aL (1989) 
J. Am. Chem. Soc. 111:8024-8026), oxides of metals and metalloids 
such as Pt-PtO, Si-SiO, Au-AuO, Ti0 2 , Cu-CuO, and the like, and 
compound semiconductors, such as lithium niobate, gallium arsenide and 
indium-phosphide. Other solid surfaces for use herein are well known to 
those of skill in the art. See, e^, U.S. Patent Nos. 4,507,230, 
4,006, 1 1 7, 5,389,449, 5,556,752, 4,683,202 and 5,744,305; 
International Patent Application Publication Nos. WO 00/04382, WO 
00/04390 and WO 00/04389; Merrifield (1964) Biochemistry 
3:1 385-1 390; Berg et aL (1 990) in Innovation Perspect. Solid Phase 
Synth. Col lect. Pap. . Int. Symp., 1st, Epton, Roger (Ed), pp. 453-459; 
Berg et aL (1989) in Pent.. Proc. Eur. Pent. Svmp. . 20th, Jung, G. et aL 
(Eds), pp. 1 96-1 98; Berg et aL (1 989) J. Am. Chem. Soc. ~ — . 

111:8024-8026; Kent et aL (1 979) Isr. J. Chem. 17:243-247; Kent et aL 
(1978) J. Org. Chem. 43:2845-2852- Mitchell et aL (1 976) Tetrahedron 
Lett. 42:3795-3798; and Hermanson et aL (1 992) Immobilized Affinity 
Ligand Technioues . Academic Press, Inc., San Diego. 

Synthetic solid surfaces include those made from polymers and co- 
polymers such as polyvinylalcohols, acrylates and acrylic acids such as 
polyethylene-co-acrylic acid, polyethylene-co-methacrylic acid, polyethy- 
lene-co-ethylacrylate, polyethylene-co-methyl acrylate, polypropylene-co- 
acrylic acid, polypropylene-co-methyl-acrylic acid, polypropylene-co-ethyl- 
acrylate, polypropylene-co-methyl acrylate, polyethylene-co-vinyl acetate, 
polypropylene-co-vinyl acetate, and those containing acid anhydride 
groups such as polyethylene-co-maleic anhydride, polypropylene-co- 
maleic anhydride and the like. Liposomes have also been used as solid 
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surfaces for affinity purifications (see, e.g. , Powell et aL (1989) 
Biotechnol. Bioenq. 33 :173). 

For example, U.S. Patent No. 5,403,750, describes the preparation 
of polyurethane-based polymers. U.S. Pat. No. 4,241,537 describes a 
5 plant growth medium containing a hydrophilic polyurethane gel composi- 
tion prepared from chain-extended polyols; random copolymerization is 
preferred with up to 50% propylene oxide units so that the prepolymer 
will be a liquid at room temperature. U.S. Pat. No. 3,939,123 describes 
lightly crosslinked polyurethane polymers of isocyanate terminated 

10 prepolymers comprised of poly(ethyleneoxy) glycols with up to 35% of a 
poly(propyleneoxy) glycol or a poly(butyleneoxy) glycol. In producing 
these polymers, an organic polyamine is used as a crosslinking agent. 
Other solid 'surfaces and preparation thereof are described in U.S. Patent 
Nos. 4,177,038, 4,175,183, 4,439,585, 4,485,227, 4,569,981, 

15 5,092,992, 5,334,640, and 5,328,603. 

U.S. Patent No. 4,162,355 describes a polymer suitable for use in 
affinity chromatography, which is a polymer of an aminimide and a vinyl 
compound having at least one pendant halo-methyl group. An amine 
ligand, which affords sites for binding in affinity chromatography is 

20 coupled to the polymer by reaction with a portion of the pendant 

halo-methyl groups and the remainder of the pendant halo-methyl groups 
are reacted with an amine containing a pendant hydrophilic group. A 
method of coating a substrate with this polymer is also described. An 
exemplary aminimide is 1,1-dimethyl-1-(2-hydroxyoctyl)amine 

25 methacrylimide and vinyl compound is a chloromethyl styrene. 

U.S. Patent No. 4,171,412 describes specific matrices based on 
hydrophilic polymeric gels, preferably of a macrdporous character, which 
carry covalently bonded D-aminoacids or peptides which contain 
D-aminoacid units. The basic carrier is prepared by copolymerization of 

30 hydroxyalkyl esters or hydroxyalkylamides of acrylic and methacrylic acid 
with crosslinking acrylate or methacrylate comonomers are modified by 
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the reaction with diamines, aminoacids or dicarboxylic acids and the 
resulting carboxyterminal or aminoterminal groups are condensed with 
D-analogs of aminoacids or peptides. The peptide containing 
D-aminoacids also can be synthesized stepwise on the surface of the 
5 carrier. 

U.S. Patent No. 4,178,439 describes a cationic ion exchanger and 
a method for preparation thereof. U.S. Patent No. 4,180,524 describes 
chemical syntheses on a silica support. 

The solid supports can be identifiable, such as by with electronic 
0 tags or bar codes (see, e.g., U.S. Patent No. 6,025,1 29; U.S. Patent No. 
6,017,496; U.S. Patent No. 5,972,639; U.S. Patent No. 5,961,923; U.S. 
Patent No. 5,925,562; U.S. Patent No. 5,874,214; U.S. Patent No. 
5,751,629; U.S. Patent No. 5,741,462), or chemical tags (see, U.S. 
Patent No. 5,432,018; U.S. Patent No. 5,547,839), addressable arrays 
5 and other such addressing methods. Supports with eectronic tags 

include supports with radio-frequency tags, such as IRORI MICROKANS® 
and MICROTUBES® microreactors (see, U.S. Patent No. 6,025,129; U.S. 
Patent No. 6,017,496; U.S. Patent No. 5,972,639; U.S. Patent No. 
5,961,923; U.S. Patent No. 5,925,562; U.S. Patent No. 5,874,214; U.S. 
0 Patent No. 5,751,629; U.S. Patent No. 5,741,462; International PCT 
application No. W098/31 732; international PCT application No, 
W098/15825; and, see, also U.S. Patent No. 6,087,186). 

The solid surfaces can take the form of beads, capillaries, flat 
supports such as glass fiber filters, multiwell plates, thin films, 
5 membranes, wafers, combs, pins, beads in pits of flat surfaces such as 
wafers (e^u, silicon wafers), with or without filter plates, pellets, disks, 
continuous surfaces such as a microliter dish or well, hollow fibers, 
needles, solid fibers, slides, chips, sheets, containers or meshes. The 
solid surfaces can be formed in arrays for combinatorial synthesis or 
analysis, or for formation of a combinatorial library. Other supports for 
use herein include paper plus coated with a suitable matrix 
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15 



The solid surface is modified to possess a hydrazino, oxyamino, or 
carbonyl group that is complementary to the hydrazino, oxyamino or 
carbonyl moiety of the oligonucleotide of formula (II) for formation of a 
hydrazone or oxime linkage. 

The hydrazino or oxyamino modified surfaces are prepared from 
commercially available solid supports or known supports such as those 
described above. Reaction of the support with, e.g. , 5- 
succinimidyloxycarbonylpyrid-2yl acetone hydrazone having the formula: 
O 



SuO 




NH— N 



Me 
Me 



20 



25 



or the corresponding oxyamino compound having the formula: 
O 



SuO 




O— N 



Me 
Me 



forms a hydrazone or oxime modified solid surface. Alternatively, the 
hydrazone or oxime succinimidyl esters shown above are reacted with an 

30 amino-substituted silane, such as 3-aminopropyltriethoxysilane, to form a 
silyl hydrazone or oxime. Reaction of the silyl group with a solid surface 
forms a hydrazone or oxime modified solid surface. Hydrolysis of the 
hydrazone or oxime group under mild acidic conditions ( e.g. , 0,1 M 
acetate, pH 4.7) affords the hydrazino or oxyamino modified surfaces 

35 provided herein. See, co-pending, co-owned U.S. provisional application 
Serial No. 60/191,186, the disclosure of which is incorporated herein by 
reference in its entirety. 
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In one embodiment, hydrazino modified gold surfaces are prepared 
by reaction with a dihydrazino-dithiol compound of formula X-M-S-S-M-X, 
where each S is sulfur, each X is independently a protected or 
unprotected hydrazino group, and each M is independently selected as 
5 above. In preferred embodiments, the compound is symmetrical and has 
the formula (X-M-S-) 2 . In these embodiments, M preferably has any 
combination, preferably 1 to 100, more preferably 1 to 50, most 
preferably 1 to 10, of the following groups: arylene, heteroarylene, C(E), 
N(R') and C(R 1 ) 2 , where E and R 1 are selected as above, which groups 
can be combined in any order. R 1 is preferably hydrogen and E is 
preferably oxygen. In more preferred embodiments, M has 1 to 10 of the 
following groups: pyridylene, NH, C(O) and CH 2 , which groups can be 
combined in any order. M is most preferably derived from cysteamine. 
Briefly, reaction of cysteamine dimer (Le., (H 2 NCH 2 CH 2 S) 2 ) with 
succinimidyl 2-BOC-hydrazino-5-pyridylcarboxylate, followed by 
deprotection with, e^, dilute hydrochloric acid, provides the dihydrazino- 
dithiol: 




O 



Reaction of this dithiol with a gold surface results in cleavage of the 
dithiol bond to form a hydrazino modified gold surface (Le., Au-S-CH 2 CH 2 - 
NHC(0)-pyr-NHNH 2 ). 

Carbonyl modified surfaces are prepared from commercially - 
available solid supports such as those described above. Raction of the 
support with, e.g. . succinimidyl 4-formylbenzoate or 4'-succinimidyloxy- 
carbonylacetophenone forms a carbonyl, particularly an aldehyde or 
methyl ketone modified solid surface. Alternatively, succinimidyl 4-formyl 
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benzoate or 4'-succinimidyloxycarbonylacetophenone is reacted with an 
amino-substituted silane, such as 3-aminopropyltriethoxysilane, to form a 
silyl aldehyde or ketone. Reaction of the silyl group with a solid surface 
forms an aldehyde or ketone modified solid surface. 
5 2. Modified Second Components 

Hydrazine oxyamino, and carbonyl modified second components 
can be prepared by methods provided herein, or by methods well known 
to those of skill in the art. 

a. Modified Oligosaccharides and Proteins 
. For example, an oligosaccharide can be reacted with cyanogen 
bromide or 1 -cyano-4-dimethylaminopyridinium bromide (CDAP) to affor a 
cyano ether of formula R-O-CN. Reaction of this cyano ether with a 
hydrazino, e-xyamino or carbonyl substituted amine, provided herein, 
affords a hydrazino, oxyamino or carbonyl substituted oligosaccharide, 
where the hydrazino, oxyamino or carbonyl group is linked to the 
oligosaccharide through a group of formula -0-C( = NH)-NH-. 

In one embodiment, the hydrazino, oxyamino or carbonyl 
substituted amine provided herein for oligosaccharide modification has 
the formula: • 
H 2 N-M-X, 

or a derivative thereof, where M and X are as defined above. In certain 
embodiments, the compound of formula H 2 N-M-X is selected where M is 
not C,. 12 alkyl. 

In the above embodiments, the modified oligosaccharide has the 
formula: 

O 2 -0-C( = NH)-NH-M-X, 

where O 2 is an oligosaccharide; and M and X are as defined above. 

Alternatively, oligosaccharides are derivatized as their hydrazino 
analogs by reaction with CDAP and a dihydrazino compound. The 
dihydrazino compounds have the formula: 
X-M-X, 
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or a derivative thereof, where each X is independently a hydrazino group 
and M is selected as above. In preferred embodiments, M has any 
combination, preferably 1 to 100, more preferably 1 to 50, most 
preferably 1 to 10, of the following groups: arylene, heteroarylene, 
5 C(R 1 ) 2 , N(R 1 ) and C(E), where R 1 and E are selected as above, which 
groups can be combined in any order. In preferred embodiments, R 1 is 
hydrogen and E is oxygen. In more preferred embodiments, M has 1 to 
10 of the following groups: heteroarylene, CH 2/ NH and 0(0), which 
groups can be combined in any order. M most preferably is derived, in 
10 these embodiments, from reaction of 1,6-hexanediamine with two 
equivalents of succinimidyl N'-BOC-6-hydrazinonicotinate. Thus, the 
reagent of formula X-M-X most preferably has formula: 



H 2 NHf\k 




20 

/ • 

The resulting hydrazino modified oligosaccharide is then conjugated 

to a carbonyl modified oligonucleotide provided herein. Similarly, 

di(oxyamino) compounds of formula X-M-X, or a derivative thereof, where 

25 X is an oxyamino group and M is selected as above, can be used to 

generate oxyamino modified oligosaccharides for conjugation with 

carbonyl modified oligonucleotides. 

Polysaccharides or glycoproteins can be oxidized with sodium 

periodate to yield aldehyde groups directly. These aldehyde groups can 

3p be directly labeled with hydrazine modified oligonucleotides as described 

herein. Lysine moieties on proteins can be modified to incorporate 

aldehydes using succinimidyl 4-formylbenzoate (Pierce Chemical Co.). 

These aldehyde containing proteins can be directly conjugated to 

hydrazine modified oligonucleotides. 
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Hydrazine groups can also be incorporated on insoluble 
polysaccharides such as cellulose or agarose using l-cyano-4- 
dimethylaminopyridine bromide (CDAP) as described by Lees et aL 
((1996) Vaccine 14:190 and (2000) Vaccine 18:1273) using a 
dihydrazine molecule (see, e.g. . Example 16). 

Modified proteins possessing hydrazino or oxyamino groups can be 
prepared by the methods described in U.S. Patent Nos. 5,206,370, 
5,420,285 and 5,753,520, and European Patent Specification EP 0 384 
769 B1, the disclosures of which are incorporated herein by reference in 
their entirety. These hydrazine modified proteins can be directly 
conjugated to aldehyde modified oligonucleotides prepared as described 
herein. Carbonyl modified proteins are prepared by reaction of an 
aldehyde or'ketone substituted succinimidyl ester with nucleophilic 
groups of the protein, including, but not limited to, the amino side chain 
of lysine. In certain embodiments, the aldehyde or ketone substituted N- 
hydroxysuccinic ester is, for example, succinimidyl 4-formyl- or 4- 
acetylbenzoate. • In other embodiments, the protein is reacted with 
succinimidyl leyulinate (succinimidyl 4-oxopentanoate) to provide a 
ketone substituted protein. 

Methods of synthesis of hydrazino, oxyamino, and carbonyl 
modified oligonucleotides are provided herein. 

In other embodiments herein, particularly when the modified 
oligonucleotide possesses a hydrazino or oxyamino group, the second 
component can possess an epoxide, a-bromocarbonyl, maleimide, maleic 
anhydride, isothiocyanate or isocyanate group. Such second components 
can be prepared by methods provided herein or other methods well 
known to those of skill in the art. For example, reaction of 
pentafluorophenyl 4-isothiocyanatobenzoate with an amino or hydroxy 
group of a second component results in formation of an isothiocyariato 
modified solid surface. Reaction of the hydrazino or oxyamino group of 
the modified oligonucleotide with the epoxide, a-bromocarbonyl, 
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maleimide, maleic anhydride, isothiocyanate or isocyanate group of the 
second component results in covalent attachment of the oligonucleotide 
to the second component. In these embodiments, the conjugation 
reaction with a hydrazino modified oligonucleotide can be conducted at a 
5 lower pH than for the corresponding amino modified oligonucleotide, 
b. Modified Indicators 
Modified indicators can be prepared according to the methods 
provided herein. The modified indicators are conjugated with the 
modified oligonucleotides of formula (II) to form a hydrazino or oxime 

10 linkage. Preferred indicators include, but are not limited to, fluoresceins, 
rhodamines and cyanine dyes. The modified indicators possess a 
hydrazino, oxyamino or carbonyl group that is complementary to the 
carbonyl, oxyamino or hydrazino group of the monomers of formula (I) or 
the modified oligonucleotides of ofrmula (II). 

15 In one embodiment, the modified indicators have the formula: 

F*-M-X, 

or a derivative thereof, where F* is a fluorescein, a rhodamine or a 
cyanine dye; X is a protected or unprotected hydrazino group, a protected 
or unprotected oxyamino group or a carbonyl group; and 

20 M is a divalent group having any combination of the following 

groups, which can be combined in any order: arylene, heteroarylene, 
cycloalkylene, C(R 1 ) 2 , -C(R 1 ) = C(R 1 K >C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -C = C-, 
O, S(A) a , P(D) b (R 1 ), RD^ER 1 ), NCR 1 ), >N + (R 2 )(R 3 ) and G(E); where a is 0, 
1 or 2; b is 0, 1, 2 or 3; A is O or NR 1 ; D is S or O; and E is S, O or NR 1 ; 

25 each R 1 is a monovalent group independently selected from hydrogen and 
M 1 -R 4 ; each M 1 is a divalent group independently having any combination 
of the following groups, which groups can be combined in any order: a 
direct link, arylene, heteroarylene, cycloalkylene, C(R 5 ) 2 , -C(R 5 ) = C(R 5 )-, 
>C = C(R 2 )(R 3 ), >C(R 2 )(R 3 ), -CsC-, 0, S(A) a , P(D) b (R 5 ), P(D) b (ER 5 ), N(R 5 ), 

30 N(COR 5 ), > N + (R 2 )(R 3 ) and C(E); where a is 0, 1 or 2; b is 0, 1, 2 or 3; A 
is O or NR 5 ; D is S or O; and E is S, 0 or NR 5 ; R 4 and R 5 are each 
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independently selected from among hydrogen, halo, pseudohalo, cyano, 
azido, nitro, SiR 6 R 7 R 8 , alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, 
aralkyl, aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy 
and NR 9 R'°; R 9 and R 10 are each independently selected from hydrogen, 
alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aralkyl. heteroaryl, heteroaralkyl 
and heterocyclyl; R 2 and R 3 are selected from (i) or (ii) as follows: (i) R 2 
and R 3 are independently selected from among hydrogen, alkyl, alkenyl, 
alkynyl, cycloalkyl, aryl and heteroaryl; or (ii) R 2 and R 3 together form 
alkylene, alkenylene or cycloalkylene; R 6 , R 7 and R 8 are each 
independently a monovalent group selected from hydrogen, alkyl, alkenyl, 
alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, aralkenyl, aralkynyl, 
heteroaryl, heteroaralkyl, heteroaralkenyl, heteroaralkynyl, heterocyclyl, 
heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, hydroxy, 
alkoxy, aryloxy, aralkoxy, heteroaralkoxy and NR 9 R 10 ; and 

R\ R 2 , R 3 , R 4 , R 5 , R 6 , R 7 , R 8 , R 9 and R 10 are unsubstituted or 
substituted with one or more substituents each independently selected 
from Z, wherein Z is selected from alkyl, alkenyl, alkynyl, aryl, cycloalkyl, 
cycloalkenyl, hydroxy, S(0) h R 20 , NR 20 R 21 , COOR 20 , COR 20 , CONR 20 R 21 , 
0C(O)NR 20 R 2 \ N{R 20 )C(O)R 21 , alkoxy. aryloxy, heteroaryl, heterocyclyl, 
heteroaryloxy, heterocyclyloxy, aralkyl, aralkenyl, aralkynyl, heteroaralkyl, 
heteroaralkenyl, heteroaralkynyl, aralkoxy, heteroaralkoxy, 
alkoxycarbonyl, carbamoyl, thiocarbamoyl, alkoxycarbonyl, carboxyaryl, 
halo, pseudohalo, haloalkyl and carboxamido; h is 0, 1 or 2; and R 20 and 
R 21 are each independently selected from among hydrogen, halo, 
pseudohalo, cyano, azido, nitro, trialkylsilyl, dialkylarylsilyl, alkyldiarylsilyl, 
triarylsilyl, alkyl, alkenyl, alkynyl, haloalkyl, haloalkoxy, aryl, aralkyl, 
aralkenyl, aralkynyl, heteroaryl, heteroaralkyl, heteroaralkenyl, 
heteroaralkynyl, heterocyclyl, heterocyclylalkyl, heterocyclylalkenyl, 
heterocyclylalkynyl, hydroxy, alkoxy, aryloxy, aralkoxy, heteroaralkoxy, 
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amino, amido, alkylamino, dialkylamino, alkylarylamino, diarylamino and 
arylamino. 

In preferred embodiments herein, the modified indicators are those 
where the fluorescein is 5-aminofluorescein, 6-aminofluorescein, 
5 fluorescein 5-isothiocyanate, fluorescein 6-isothiocyanate or fluorescein 
thiosemicarbazide; the rhodamine is rhodamine 1 23, rhodamine B, 
rhodamine B isothiocyanate, rhodamine 6G, rhodamine 1 10 or ROX 
having the structure: 




20 

where each R is independently H or COOH; and the cyanine dye is Cy3, 
benzo-Cy3, dibenzo-Cy3, Cy5, benzo-Cy5, dibenzo-Cy5, Cy7, benzo-Cy7 
or dibenzo-Cy7 having the structures: 

25 



30 



35 



40 
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5 




n = 1 (Cy3), 2 (Cy5) or 3 (Cy7) 




20 n = 1 (benzo-Cy3), 2 (benzo-Cy5) or 3 (benxo-Cy7) 



25 



30 




n= 1 (dibenzo-Cy3), 2 (dibenzo-Cy5) or 3 (dibenxo-Cy7) 

35 See, e.g. , FIGURE 12 and International Patent Application Publication No. 
WO 99/65993. 

In more preferred embodiments, the modified indicator is chosen 

from 

40 
45 
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35 

or a derivative thereof, where F* is selected as above. 

More preferably, the modified indicators are those where F* is 
fluorescein, rhodamine, ROX, Cy3 or Cy5. Most preferably, the modified 
indicators have the formula: 

40 



45 
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H H O 

I I II 




N NHNH 2 

where F* is fluorescein, rhodamine, ROX, Cy3 or Cy5. 
D. Methods of Use of Modified Oligonucleotides 

The hydrazino, oxyamino, and carbonyl modified oligonucleotides 
provided herein have application in numerous diagnostic methods. One 
application is use of the modified oligonucleotides in a DNA probe 
detection system. See, e.g. . U.S. Patent No. 5,876,938. In such a 
system, a modified polynucleotide is prepared using random-primed DNA 
labeling and.a hydrazino, oxyamino or carbonyl containing monomer of 
formula (I) provided herein. It is to be understood that any of the 
monomers described herein could also be used, as well as alternative 
methods of polymer formation (e.g., nick translation, solid phase 
synthesis, and terminal transferase). Following preparation of the 
hydrazino, oxyamino or carbonyl labeled probe, a blot hybridization can be 
carried out in which the labeled probe is applied to filter-bound DNA 
under conditions in which hybridization takes place between the probe 
and a target polynucleotide. The presence of a target polynucleotide can 
then be determined using, for example, enzyme-linked detection. Alkaline 
phosphatase that is linked to a carbonyl, oxyamino or hydrazino moiety 
complementary to the hydrazino, oxyamino or carbonyl group of the 
modified oligonucleotide, will react with the hydrazino, oxyamino or 
carbonyl portion of the probe. Subsequent treatment with a substrate for 
alkaline phosphatase which forms a detectable product provides a means 
by which the presence of the target nucleic acid or polynucleotide can be 
detected. 

Alternatively, the modified oligonucleotides provided herein can be 
used for affinity purification of a target oligonucleotide. In one 
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embodiment, the probes are prepared as described above and added to a 
mixture of oligonucleotides containing a target oligonucleotide. Following 
hybridization, magnetic beads having an attached reactive moiety that is 
complementary to the hydrazine oxyamino or carbonyl group of the 
5 probe are placed in the mixture and covalent bonding to the hydrazine 
oxyamino or carbonyl labeled probe occurs via a hydrazone or oxime 
linkage. The beads (with attached probe and target) are drawn to a 
magnetic plate and the remaining materials are washed away. 

In other embodiments, the modified oligonucleotides can be used in 

10 combination with other purification and labeling methods to provide 

unique methods of isolating and detecting picogram quantities of target 
oliognucleotides. 

Accordingly, provided herein are methods for detecting the 
presence of a target nucleic acid in a sample, comprising; 

15 (a) contacting a sample with a modified oligonucleotide which is 

substantially complementary to the target nucleic acid under conditions 
sufficient to hybridize the modified oligonucleotide to the target nucleic 
acid thereby forming a hybridized complex; 

(b) contacting the hybridized complex with a complexing agent 
20 which comprises a detectable moiety or an indicator, and a carbonyl, 

oxyamino or hydrazino moiety that is complementary to the hydrazine, 
oxyamino or carbonyl group of the modified oligonucleotide; and 

(c) detecting the presence of the detectable moiety or indicator, 
thereby detecting the presence of the target nucleic acid. 

25 The hydrazino, oxyamino and carbonyl modified oligonucleotides or 

formula (II) provided herein are useful in a variety of other methods, 
including, but not limited to, diagnostic probe assays, DNA amplification 
by solid phase polymerase chain reactions (PGR), molecular computing 
(see/ e.g., Adleman (1994) Science 266 :1021-1024: Kari (1997) 

30 Mathematical Intelligencer J_9:9-22; Frutos et aL ( 1 997) Nucleic Acids 

Res. 25:4748; Smith et aL (1998) J. Comp. Biol. 5:255: Liu et aL (1998) 
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J. Comp. Biol. 5:267; Frutos et aL (1 998) J. Am. Chem. Soc. 
120:10277; Wang et aL (1999) Biosystems 52:189-191: Liu et aL 
. (1999) Biosystems 52:25-33; Liu et aL (2000) Nature 403 :175-179: 
European Patent Application Publication No. EP 0 772 135; Reed et aL 
5 (June 2000) Scientific American :86-93). molecular addressing (Niemeyer 
et aL (1 994) NucL Acids Res. 22(25) :5530-5539h DNA sequencing by 
mass spectrometry (see, e.g. , U.S. Patent Nos. 6,074,823 and 
5,547,835) and in studying the molecular electronics of DNA (see, e.g. , 
U.S. Patent Nos. 6,071,699, 6,066,448, 5,952,172 and 5,824,473). In 

10 general, the modified oligonucleotide is immobilized as described herein 
for use in the above methods. 

The hydrazino modified oligonucleotides prepared using the 
reagents oMormula (I) are also useful for a variety of additional purposes, 
including, but not limited to: 

1 5 (i) direct attachment to surfaces such as carbonyl, Le^, aldehyde or 

ketone, modified glasses including controlled pore glass, plastics including 
polyethylene, polypropylene, polyamide and polyvinyldenedifluoride, 
celluloses including nitrocelluloses, latexes, metals including steel, gold, 
silver, aluminum, silicon and copper, silicas, agaroses including 

20 Sepharose*, Sephadex* and polystyrenes, in the form of beads, capillaries, 
flat supports such as glass fiber filters, multiwell plates or membranes, 
wafers, combs, pins (e.g., arrays of pins suitable for combinatorial 
synthesis or analysis) or beads in pits of flat surfaces such as wafers 
( e.g. , silicon wafers), with or without filter plates; resulting in 

25 oligonucleotides attached via a hydrazone bond formed between the 
hydrazino oligonucleotide and the carbonyl moiety on the surface; and 

(ii) direct attachment to any biopolymer which has been modified 
to include a carbonyl moiety using, for example, periodate oxidation of 
carbohydrates or functionalization of amino moieties using carbonyl 

30 modification reagents such as succinimidyl 4-formyl benzoate. See, e.g. . 
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Ghosh (European Patent Application Publication No. Ep 0 361 768), the 
disclosure of which is incorporated herein in its entirety. 

In the cases wherein the final hydrazino product is required directly 
from standard aqueous ammonia deprotection and cleavage from a solid 
5 support, the hydrazino protecting group is a labile protecting group, 
including, but not limited to, FMOC, benzoyl or acetyl, that is removed 
during deprotection of the other oligonucleotide protecting groups. Thus 
direct addition of the crude reaction product to a solid surface 
appropriately modified with a moiety that reacts with the hydrazino 
10 function will lead to covaient modification of the surface, and non- 

hydrazino modified oligonucleotides and other impurities will be washed 
away. 

E. Preparation of the Compounds 
1. Monomers 

15 In general, the monomers of formula (I) provided herein can be 

prepared by methods well known to those of skill in the art, or by 
modification of those methods using techniques known to those of skill in 
the art. 

For example, monomers of formula (I) where X is a carbonyl group, 
20 particularly an aldehyde or ketone group, can be prepared starting from 
formyl- or keto-substituted carboxylic acids and carboxylic acid 
derivatives. Many such formyl- or keto-substituted carboxylic acids and 
carboxylic acid derivatives, including 4-formylbenzoic acid and 4*- 
carboxyacetophenone are commercially available (see, e.g. , Aldrich 
25 Chemical Co., Milwaukee, Wisconsin). Reaction of the succinimidyl ester 
or other activated ester of these carboxylic acids, prepared, e.g. , from the 
carboxylic acid, N-hydroxysuccinimide and dicyclohexylcarbodiimide, 
with, e.g. , an amino or hydroxy substituted nucleoside triphosphate 
provides the desired carbonyl substituted nucleoside triphosphate (see, 
30 e.g., FIGURE 5). 
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In another example, the monomers of formula (I) where X is a 
hydrazino group or oxyamino group are prepared starting from hydrazino- 
or oxyamino-substituted carboxylic acids and carboxylic acid derivatives. 
Such hydrazino or oxyamino compounds are prepared by reaction of the 
5 corresponding halocarboxylic acid with, e.g. . hydrazine, hydroxylamine, 
or derivatives thereof. Semicarbazides and thiosemicarbazides are 
prepared by reaction of hydrazine with the corresponding isocyanates and 
isothiocyanates. Carbazides and thiocarbazides can be prepared by 
reaction of a hydrazine derivative with phosgene or thiophosgene, or an 
0 equivalent thereof, and hydrazine. Hydrazides are prepared by reaction of 
an activated carboxylic acid with hydrazine. A carbonic acid dihydrazine 
is prepared by reaction of an isocyanate with carbonic acid dihydrazide. 
A hydrazine carboxylate can be prepared by reaction of an alcohol with 
phosgene and hydrazine. 

The resulting hydrazino or oxyamino carboxylic acid is then 
protected. The protecting group is a salt or is any amine or hydrazine 
protecting group known to those of skill in the art. 

Reaction of the succinimidyl ester or other activated ester of 
protected hydrazino or protected oxyamino carboxylic acids, prepared, 
e-3-> from the carboxylic acid, N-hydroxysuccinimide and 
dicyclohexylcarbodiimide, with, e.g. . an amino or hydroxy substituted 
nucleoside triphosphate provides, after deprotection, the desired 
hydrazino or oxyamino substituted nucleoside triphosphate (see, e.g. . 
FIGURES 7 and 8). 

2. Modified oligonucleotides 

The hydrazino, oxyamino or carbonyl-modified oligonucleotides 
provided herein can be synthesized in solid phase or in solution, using the 
above hydrazino, oxyamino or carbonyl-modidified monomers and other 
nucleoside bases. In some embodiments, the hydrazino, oxyamino or 
carbonyl-containing oligonucleotides are prepared using enzyme-based 
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methodology such as PCR, random prime labeling, tailing or nick 
translation. 

The synthesis of modified polynucleotides containing a modified 
monomer described above can also be achieved by enzyme-based 
5 methods. Pyrimidine, purine and deazapurine, preferably cytosine, uracil, 
adenine, guanine and thymine, more preferably cytosine, nucleoside 
triphosphates containing a hydrazino, oxyamino or carbonyl moiety linked 
to the heterocyclic ring can be used as substrates for a wide variety of 
purified nucleic acid polymerases of both prokaryotic and eukaryotic 

10 origin. These include Taq DNA polymerase, DNA polymerase I of E. coli, 
bacteriophage T4 DNA polymerase, DNA polymerases alpha and beta 
from murine (A-9) and human (HeLa) cells, and the DNA polymerase of 
Herpes simplex virus. Nick-translation, random prime labeling, and 
terminal transferase tailing are also useful methods for the incorporation 

15 of a modified nucleic acid monomer into an polynucleotide. Nick- 
translation can be carried out as described in Rigby, et aJL (1977) J. MoL 
Biol. 1 1 3 :237-251 . incorporated herein by reference. Random prime 
labeling can be conducted utilizing a modification of the method of 
Feinberg, et aL (1 988) Anal. Biochem. 132 :6-13 in which a modified 

20 monomer is used in place of dTTP. Incorporation can be verified by 

capture of the probe (or modified oligonucleotide) on DHBHA-Sepharose. 
Tailing, or terminal transfer can be carried out using the method of 
Abhay, et ah (1988) Anal. Biochem. 169 :376-382 in which a modified 
-(boronic acid-containing) monomer is diluted into dTTP. As above, 

25 incorporation of the modified monomer can be verified by capture of the 
probe on DHBHA-Sepharose. 

3. Modified indicators 

The modified indicators provided herein can be prepared by the 
following methods, or by other methods known to those of skill in the 
30 art. In general, the commercially available (Amersham, BDL) fluorescent 
compound F* is dervatized as an amine, alcohol or thiol, preferably an 
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amine (see, e.g. , FIGURE 6). Reaction of this F* derivative with an 
activated carboxylic acid compound, such as an acyl halide, a mixed 
anydride, an aryl or heteroaryl ester, a naphthalimidyl ester, or a 
succinimidyl ester, preferably a succinimidyl ester, that is substituted with 
5 a protected hydrazine protected oxyamino, or carbonyl group followed 
by deprotection provides the modified indicators of formula F*-M-X 
provided herein. 

Alternatively, the fluorescent compound F* can be derivatized as 
an activated carboxylic acid compound, such as an acyl halide, a mixed 

10 anydride, an aryl or heteroaryl ester, a naphthalimidyl ester, or a 

succinimidyl ester, preferably a naphthalimidyl ester. Reaction of this F* 
derivative with an amine, alcohol or carbonyl compound that possesses a 
protected of unprotected hydrazino, a protected or unprotected 
oxyamino, or carbonyl moiety affords, following deprotection, the 

15 modified indicators provided herein. Certain fluorescent compounds F* 
that are derivatized as activated carboxylic acids are known to those of 
skill in the art (see, e.g. . International Patent Application Publication No, 
WO 99/65993, and U.S. Patent Nos. 4,337,063, 4,404,289, 4,405,71 1, 
5,486,616, 5,569,587, 5,569,766 and 5,627,027). 

20 In another alternative, an amino derivative of a fluorescent 

compound F* is converted to the corresponding isothiocyanate by 
reaction with, e.g. , thiophosgene or thiocarbonyldiimidazole, and a 
tertiary amine, such as triethylamine or diisopropylethylamine. Reaction 
of this isothiocyanate derivative with an amine possessing a protected 

25 hydrazino, protected oxyamino or carbonyl group gives, following 

deprotection, the hydrazino, oxyamino and carbonyl modified indicators 
provided herein. Alternatively, reaction of the isothiocyanate derivative 
with an a,w-diaminoalkane, such as 1,6-diaminohexane, followed by 
reaction with a protected hydrazino, protected oxyamino or carbonyl 

30 modified succinimidyl ester, as described herein, gives, after deprotection, 
the hydrazino, oxyamino or carbonyl modified indicators provided herein. 
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The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. 

EXAMPLE 1 
Synthesis of aldehyde triphosphate 
5 5-Amino substituted cytidine triphosphate 5-1 was prepared from 

2-deoxycytidine as described in U.S. Patent No. 5,242,796. As shown in 
FIGURE 5, triphosphate 5-1 was dissolved in 0.1 M phosphate, 0.15 M 
sodium chloride, pH 7.4 and diluted with DMF. A solution of succinimidyl 
4-formylbenzoate (5-2; mmol, equiv, Pierce Chemicals, Rockford, IL) in 

10 DMF was added to the triphosphate. The reaction mixture was incubated 
at room temperature and the progress of the reaction was followed by C- 
18 reverse phase HPLC (mobile phase 1 : 50 mM triethylammonium 
acetate, pH 7.4, mobile phase 2: acetonitrile; gradient: 0% 2 to 20% 2 
over 20 min, 20% 2 to 50% 2 over 5 min; flow: 2 mL/min). Further 

15 reagent was added to drive the reaction to completion. 

The product 5-3 was isolated by ion-exchange HPLC using DEAE- 
Sephadex witha gradient from water (500 mL) to 0^6 M LiCI (500 ml) as 
the eluting buffer. The compound was desalted by precipitation from 
acetone/MeOH. 31 P NMR, .'H NMR and mass spectral data confirmed the 

20 structure of the product. 

EXAMPLE 2 

A. Synthesis of N-(6-fluoresceinyl)-N'-(6-aminohexyl)-thiourea 

As shown in FIGURE 6, this compound was obtained by dissloving 
fluorescein isothiocyanate 6-1 (FITC)(1.03 g, 2.64 mmol) in 15 ml of 

25 MeOH in a 50 ml oven dried flask. This formed an orange suspension 
that was allowed to stir for 5 min at room temperature after witch 1 ,6- 
diaminohexane (1.23 g, 10.56 mmol) was added all at once and the 
solution instantly went homogeneous. This solution was allowed to stir 
at room temperature for 48 hr and was covered with aluminum foil after 

30 which the reaction was completely evaporated in vacuo . The residue 
was dissolved in a solution of 1:1:3 EtOAc:MeOH:AcOH (4 ml). This 
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soln was loaded onto a column packed with 61 g of silica. The column 
had an inner radius of 2 cm. 500 ml of 1 :1 :2 EtOAc:MeOH:AcOH was 
pushed through the column by pressure and fractions were collected and 
analyzed by TLC. The product 6-2 had a R, of 0.68 and this spot was 
5 collected and evaporated to dryness followed by 5 X 20 ml co- 
evaporation with MeOH. Yield: 1 g of an orange solid. 75% yield. Mass 
spec, NMR, and RP-HPLC all indicated pure material. 

B. Synthesis of N-(6-fluoresceinyl)-N'-(6-(2-hydrazino-5- 
pyridylcarbonyUaminohexyO-thiourea hydrochloride 

10 The product of Example 2. A 6-2 (174 mg, 0.344 mmol) was 

dissolved in 5 ml of dry DMF and treated with a solution of succinimidyl 
2-BOC-hydrazino-5-pyridinecarboxylate (100 mg, 0.344mmol) in 2 ml of 
DMF was added by syringe all at once at room tempeture. The reaction 
mixture was allowed to stir covered for 24 hr after which the reaction 

15 was evaporated to dryness on rota vap. 4 M HCI in dioxane (40 ml) was 

added. A precipitate instantly formed. After 4 h, the precipitate was 

filtered and washed with 4 X 5 ml dioxane to yield 200 mg of 6-3 as an 

orange solid. 95% yield. 'H NMR and mass spectrometry (calc 641 .4; 

found 641 .4) confirmed the structure. 

20 C. Synthesis of N-(6-fluoresceinyl)-N'-(6-(4-formylbenzoyl)amino- 
hexyD-thiourea 

The product of Example 2. A 6-2 (100 mg, 0.1 98 mmol) was 
dissolved in a 1:2 solution of DMF:100 mM phosphate 1 mM NaCI buffer 
(pH 7.5)(10 ml). Succinmidyl 4-formylbenzoate (244 mg, 0.990 mmol) in 

25 4 ml of dry DMF was added by syringe all at once! This solution was 
allowed to stir at room temperature for 48 hr, then was evaporated to 
dryness on rota-vap. The ensuing residue was treated with 40 ml of 
dioxane and allowed to sit overnight in the dark to induce precipitation. 
The precipitate was then filtered off and precipitated one more time with 

30 Et0 2 to yield 1 10 g of pure yellow solid 6-4. 84% yield. 
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EXAMPLE 3 

Incorporation of aldehyde triphosphate 5-3 in PCR reaction 

Oligonucleotide Sequence Source 

M13Fcomplement5'G GGT CGT GAC TGG GAA AAC Trilink 

(SEQIDNO:1) 

GTT TTC CCA GTC ACG AC Genosys 
(SEQ ID NO:2) 

AGC CCA TAA CAA TTT CAC ACA GGA Genosys 
(SEQ ID NO:3) 

AT GGA AG A ATC TCT CC Genosys 
(SEQ ID N0:4) 

GGA ACA TGC TGA AAA CTG Genosys 
(SEQIDNO:5) 

TCA CCA ACT GGG ACG ACA TG Genosys 
(SEQ ID N0:6) 

Oligonucleotide suppliers: Genosys, Inc (Woodlands, TX); Trilink 
Biotechnologies (San Diego, CA). 

The incorporation of aldehyde-dCTP into PCR generated double 
stranded DNA was performed using Stoffel Taq Polymerase (Perkin-Elmer) 
using buffer provided by the manufacturer and 3.5 mM MgCI 2 . 
Deoxynucleotides dGTP, dATP and TTP were present at 0.6 mM while 
primers M13 and M13 R (see sequences above) were used at 1 //M. The 
DNA template was plasmid 80109 from the IMAGE Consortium collection 
of human cDNAs. PCR used different amounts of dCTP and aldehyde- 
dCTP at combined conctration of 0.6 mM. Individual reactions contained 
O, 0.6, 0.18 and 0.6 mM aldehyde d-CTP. Syntheses of PCR products : 
from each reaction was determined by radioincorporation of 32 P into 
Qiagen purified PCR product. The PAGE gel of the radiolabelled products 
demonstrates that reactions that included 10% and 30 /o aldehyde-dCTP 
yielded full length product. 
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EXAMPLE 4 

Fluorescent labeling of aldehyde incorporated PCR product 

Determination of incorporation of aldehyde-dCTP was performed by 
incubation of hydrazino-fluorescein 6-3 with Qiagen purified PCR products 
5 (from Example 3) in 0.1 M acetate, pH 5.0 for 2 hours at room 
teperature. The labeling reaction was terminated by addition of one 
volume of 0.1 M Tris-HCI, pH 7.5. The sample was purified twice with a 
Qiagen PCR purification kit and eluted into a final volume of 25 //L in EB 
buffer provided by Qiagen. Fluorescnt signal was assayed by spotting 2 
10 jj\ of sample ont a microscope slide (Gold Seal, Fisher Scientific) and 
analysis using a ScanArray Fluorescent Scanner. 

EXAMPLE 5 

Incorporation of aldehdye-triphosphate by a variety of enzymes using a 
single-nucleotide extension assay 

15 



20 



Oliqonucleotide 


Seauence 


Source 


M1 3Fcomplement5'G 


GGT CGT GAC TGG GAA AAC 


Trilink 




(SEQ ID N0:1) 




M13F 


GTT TTC CCA GTC ACG AC 


Genosys 




(SEQ ID NO:2) 


M13R 


AGC CCA TAA CAA TTT CAC ACA GGA 


Genosys 




(SEQ ID NO:3) 


TF3 


AT GGA AGA ATC TCT CC 


Genosys 




(SEQ ID N0:4) 


Ferritin283 


GGA ACA TGC TGA AAA CTG 


Genosys 




(SEQ ID NO:5) 


b-actin300 


TCA CCA ACT GGG ACG ACA TG 


Genosys 




(SEQ ID N0:6) 


Oligonucleotide suppliers: Genosys, Inc (Woodlands, TX); Trilink 5 





Biotechnologies (San Diego, CA) 

32 P-labelling: Oligonucleotides were radiolabeled with 32 P using 
25 gamma 32 P-ATP 3000 Ci/mmol at 10 Ci/mL (ICN, Inc.). The labeling 

raction was performed with 20 picomoles Oligonucleotide, 60 uCi gamma 
32P ATP, 10 units T4 polynucleotide kinase (Gibco BRL) and 1 10X 
reaction buffer provided by the manufacturer. The reaction was 
incubated for 20 minutes at 37 °C. 32 P labeled oligonuclotides were 
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purified using a Qiagen nucleotide purification kit and samples ere eluted 
into 50 /jL dH2) and stored -20 °C. 
Extension Assay 

Oligonucleotides M1 2Fcornplement5'G and M13F were combined 
5 at a final concentration of 4 /vM along with 8.5X106 cpm 32 labeled 
M13F in a final volume of 16 /jL. The mixture was inbcuated at 80 °C 
for 3 minutes and allowed to cool to room temperature. Single nucleotide 
extension assays were performed using a variety of enzymes, i.e. , T4 
DNA polymerase, Klenow, E. coli DNA polymerase I, Stoffel Taq 

10 polymerase, Superscript reverse transcriptse, MMLV reverse 

transcriptase, and AMV reverse transcriptase. Reaction conditions used 
buffers supplied by the manufacturers. Reactions also containd either 
dCTP or alcfehdye-dCTP. Extension reactions were stopped by freezing at 
-20 °C. Analysis of extension reactions was performed by 

15 polyacrylamide gel electrophoresis an a 12.5% polyacrylamide gel 
containing 8M urea. 

EXAMPLE 6 
Synthesis of hydrazine triphosphate 

5-Amino substituted cytidine triphosphate 7-1 was prepared from 

20 2-deoxycytidine as described in U.S. Patent No. 5,242,796. As shown in 
FIGURE 7, triphosphate 7-1 was dissolved in 0.1 M phosphate, 0.15 M 
sodium chloride, pH 7.4 and diluted with DMF. A solution of succinimidyl 
6-hydrazinonicotinate acetone hydrazone (7-2; 5 equiv, Pierce Chemicals, 
Rockford, IL) in DMF was added to the triphosphate. The reaction 

25 mixture was incubated at room temperature and the progress of the 

reaction was followed by C-18 reverse phase HPLC (mobile phase 1: 50 
mM triethylammonium acetate, pH 7.4, mobile phase 2: acetonitrile; 
gradient as in Example 1). Further reagent was added to drive the 
reaction to completion. The product 7-3 is isolated by ion-exchange HPLG 

30 using conditions as described in Example 1 . 
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EXAMPLE 7 
Synthesis of aminooxy triphosphate 

As shown in FIGURE 8, a solution of N-hydroxyphthalimide (1 
equiv; 8-1; Aldrich Chemical Co., Milwaukee, Wl) is dissolved in DMF and 
treated with triethylamine (1 equiv) and 5-bromopentanoic acid (1 equiv.; 
Aldrich Chemical Co.). The reaction mixture is heated at 50 °C and the 
progress of the reaction followed by silica gel thin layer chromatography 
until complete. The solvent is removed under reduced pressure and the 
residue is partitioned between ethyl acetate and 5% citric acid. The 
organic phase is washed* with brine, dried (magnesium sulfate, filtered 
and concentrated to the crude product. The product 8-2 is purified by 
silica gel chromatography. 

To a solution of acid 8-2 (1 equiv) in THF is added N-hydroxysuc- 
cinimide (1 equiv: Aldrich Chemical Co.) and a solution of dicyclohexyl- 
carbodiimide (1 equiv; Aldrich Chemical Co.) The reaction mixture is 
stirred at room temperature for 4 hours and the precipitate that forms 
(dicyclohexylurea) is removed by filtration. The filtrate is concentrated to 
dryness and the residue is recrystallized to yield desired succinimidyl ester 
8-3. 

5-Amino substituted cytidine triphosphate 8-4 (prepared from 2- 
deoxycytidine according to the procedure of U.S. Patent No. 5,242,796) 
is dissolved in 0.1 M phosphate, 0.15 M sodium chloride, pH 7.4 to 
prepare a 10 mg/mL solution and diluted with an equal volume of DMF. 
A solution of 8-3 (1 equiv) in DMF is added to the triphosphate solution. 
The reaction mixture is incubated at room temperature and the progress 
of the reaction is followed by C-18 reverse phase HPLC (mobile phase 1 : 
50 mM triethylammonium acetate, pH 7.4, mobile phase 2: acetonitrile; 
gradient as described in Example 1). Further 8-4 is added to drive the 
reaction to completion. The product 8-5 is isolated by ion-exchange 
HPLC. 31 P NMR, 'H NMR and mass spectral data confirmed the structure 
of the product. 
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Phthalimide 8-5 is converted into oxyamino 8-6 by addition of 8-5 
to a solution of hydrazinium acetate. The reaction mixture is incubated at 
room temperature. The purified product is isolated by ion exchange 
chromatography. 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended 
claims. 
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